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(S4) Title: INDUCTION OF CTL RESPONSES TO FOREIGN ANTIGENS EXPRESSED IN MYCOBACTERIA 



(57) Abstract 

A method of inducing a CTL response in an animal which comprises administering to the animal mycobaacria trans- 
Tormed with at least one DNA sequence which encodes a protein or peptide or fragment or derivative thereof which includes an 
epitope which is recognized by cytotoxic T lymphocytes. The mycobacteria arc administered in an amount effeaivc to induce a 
CTL response in an animal. In one embodiment, the transforming DNA sequence may encode an HIV protein or fragment or de- 
rivative thereof. 
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INDUCTION OF CTL RESPONSES TO FOREIGN 
ANTIGENS EXPRESSED IN IIYCOBACTERIA 
This invention relates to the induction of a T-c-Il 
response, in particular a cytotoxic T lymphocyte response. More 
particularly, this invention relates to the induction of CTL 
responses to proteins or polypeptides expressed by recombinant 
mycobacteria. 

Cell-mediated immunity (or CMI) of infections is thought to 
be a major line of defense against certain infections, such as 
viral infections and certain bacterial infections. For example. 
CMI may be significant in the development of an effective vaccine 
against human immunodeficiency virus (HIV), or AIDS virus, 
because HIV vaccines and/or therapies based on the generation of 
passive transfer of HlV-speciflc antibody in the absence of 
cell-mediated Immunity have not yielded consistent protection in 
primates challenged with the HIV virus. Thus, interest has 
turned to the Induction of cell-mediated responses to various 
infections, such as for example, HIV infection, and to the 
identification of proteins or polypeptides that a stimulate a 
cytotoxic T lymphocyte response, and to methods of administering 
such proteins or polypeptides. 

In accordance with an aspect of the present invention, there 
is provided a method of inducing a CTL response in an animal 
comprising administering to the animal mycobacteria transformed 
with at least one DNA sequence which encodes a protein or peptide 
or fragment or derivative thereof which includes an epitope which 
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is recognized by cytotoxic T lymphocytes. The mycobacteria are 
administered in an amount effective to induce a CTL response in 
an animal. 

Proteins or polypeptides for which the at least one DNA 
sequence may encode, include, but are not limited to, 
Mycobacterium leprae antigens; Mycobacterium tuberculoai a 
antigens; Rickettsia antigens; Chlamydia antigens; Coxiella 
antigens; malaria sporozoite and meroroite proteins, such as the 
circumspororcite protein from Plaamodiura berohei sporozoites; 
Clostridium antigens; Leishmania antigens; Salmonella antigens; 
Mycobacterium africanun antigens; Mycobacterium intracellular*. 
antigens; Mycobacterium avium antigens; E.coli antigens; Borrelia 
antigens; Listeria antigens; Franciscella antigens; Yersinia 
antigens; Treponema antigens; Schistoatoma antigens; Filaria 
antigens; Pneumococcua antigena; Staphvlococcua antigene; Herpei 
virus antigens; influenza and parainfluenza virus antigens; 
measles virus antigens; mumps virus antigens; hepatitis virus 
antigens; Shigella antigens; Bordatella antigens; Hemophi lus 
antigens; Streptococcus antigens; polio virus antigens; Rift 
Valley Fever virus antigens; dengue virus antigens; Human 
Immunodeficiency Virus (HIV) antigens; and respiratory syncytial 
virus (RSV) antigens. 

In one embodiment, the at least one DNA sequence encodes at 
least one protein or polypeptide or fragment or derivative 
thereof which includes an eptioi>e which is recognized by 
cytotoxic T lyn^hocytes induced by an HIV protein or fragment or 
derivative thereof. The at least one DNA sequence may encode an 
HIV protein or fragment or derivative thereof. HIV proteins or 
polypeptides which may be encoded by the at least one DNA 
sequence includes but are not limited to, HIV-I-gp 120; HIV-I-gp 
41; HIV-I-gp 160; HIV-I-pol; HIV-I-nef; HIV-I-tat; HIV-I-rev; 
HIV-I-vif; HIV-I-vpr; HIV-I-vpu; HIV-I-gag; HIV-2-gp 120; 
HIV-2-gp 160; HIV-2-gp 41; HIV-2-gag; HIV-2-pol; HIV-2-nef; 
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HIV.2-tat; HIV.2-rev; HIV.2-vif; HlV-S-vpr; HIV-s-vpu- and 
HIV-2-vpx. 

Mycobacteria which may be transformed with the at least one 
DNA sequence, which encodes a protein or polypeptide or fragment 
or derivative thereof which includes an epitope which is 
recognized by cytotoxic T lymphocytes, include, but are no- 
limited to. Mycobacterium hov1s-BCC. M.,mea«.i-i, m avium ' 
?L£hlei, M.fortiutum . VLlxxfu, M. par.tuber^n ... , M.habana 
M.scrofalaceum . and M.intracellular.. m a preferred embodiment 
the mycobacterium is M.bovis.BCG or a mutant thereof. 

The at least one DNA sequence may be contained within an 
expression vector, which is transformed into a mycobacterium 
whereby the mycobacterium expresses the protein or polypeptide or 
fragment or derivative thereof which includes an epitope which is 
recognized by cytotoxic T lymphocytes. 

The expression vector may be, for example, a temperate 
shuttle phasmid or a bacterial-mycobacterial shuttle plasmid. 
Each of these vectors may be used to introduce the at least one 
DNA sequence encoding a protein or polypeptide or fragment or 
derivative which includes an epitope which is recognized by 
cytotoxic T lymphocytes, stably into mycobacteria, in which the 
at least one DNA sequence may be expressed. When a shuttle 
phasmid, which replicates as a plasmid in bacteria and a phage in 
mycobactreia. Is employed, integration of the phasmid. which 
includes the at least one DNA sequence encoding a protein or 
polypeptide, or fragment or derivative thereof, which includes an 
epitope which is recognized by cytotoxic T lymphocytes, into the 
mycobacterial chromosome occurs through site-specific 
integration. The at least one DNA sequence which encodes a 
protein or polypeptide or fragment or derivative thereof, which 
includes an epitope which Is recognized by cytotoxic 
T lymphocytes, is replicated as part of the chromosomal DNA. When 
a bacterial-mycobacterial shuttle plasmid is employed, the at 
least one DNA sequence which encodes a protein or polypeptide or 
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fragment or derivative thereof, which includes an epitope whi«-h 
is recognized by cytotoxic T lymphocytes, is stably maintained 
extrachromosomally in a plasraid. Expression of the at least one 
DNA sequence occurs extrachromosomally (e.g., episoraally) . For 
example, the at least one DNA sequence is cloned into a shuttle 
plasmid and the plasmid is Introduced into a mycobacterium such 
as those hereinabove described, wherein the plasmid replicates 
episomally. Examples of such shuttle phasmids and 
bacterial-mycobacterlal shuttle plasmids are further described in 
Application Serial No. 361,944. filed June 5, 1989. which is 
hereby incorporated by reference. 

In one embodiment the mycobacteria are transformed with an 
expression vector which comprises at least one DNA sequence 
encoding a protein or polypeptide which includes an epitope which 
is recognixed by cytotoxic T lymphocytes, and a promoter selected 
from the class consisting of mycobacterial promoters and 
mycobacterl ©phage promoters for controlling expreasion of the DNA 
encoding the heterologous protein or polypeptide, or fragment or 
derivative thereof, which includes an epitope which is recognized 
by cytotoxic T lymphocytes. 

Mycobacterial and mycobacteriophage promoters which may be 
employed include, but are not limited to. mycobacterial promoters 
such as the BCC BSP60 and HSP70 promoters; the mycobactin 
promoter from M. tubereulosle and BCC; the mycobacterial 14 kda 
and 12 kda antigen promoters; the mycobacterial o-antigen 
promoter from M. tub«rculoaia or BCC; the MBP-70 promoter, the 
mycobacterial 45 kda antigen promoter from M. tuberculosis or BCC; 
the suparoxld* disnnitaae promoter; the mycobacterial asd 
promoter, and mycobacteriophage promoters such as the Bxbl. LI. 
L5. and TN4 promoters. Zn one embodiment, the promoter is a 
mycobacterial heat shock protein promoter such as BSP60 or BSP70. 

The promoter sequence may, in one embodiment, be part of an 
expression cassette which also includes a portion of the gene 
normally under the control of the promoter. For example, when a 
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mycobacterial HSP60 or HSP70 promoter is employed ^he exn^. 
cassette may. within the scope of the preseL ^^P^^ss.o 

i tne present invention, include 
xn addataon to the promoter, a portion of the gene for the HSP60 
or HSP70 p.otein. When the expression cassette and the DNA 
encoding the protein or polypeptide, or frag„.ent or derivative 
thereof, which includes an epitope which i. recognized by 
cytotoxic T lymphocyte, are expressed, the protein expressed by 
the cassette and the DKA encoding the protein or poly^eptid! Ts , 
fusion protein of a fragment of a mycobacterial proton (eg 'h 
HSPS0 O.HSP70 protein,, and the protein or pol^epti.e o"' ' ' 
fragment or derivative thereof which include, an epitope which 
IS recognized by cytotoxic T lymphocyte.. 

In a preferred embodiment, the tran.cription initiation 
site the ribosomal binding site, and the .tart codon. which 
provides for the initiation of the tr.n.lation of mPNA. .re eacfe 

of mRNA. thereby terminating .ynthe.l. of the protein or 
polypeptide or fragment or derivative thereof which include, an 
epitope which i. recognized by cytotoxic T lymphocyte., and the 
transcription termination site, «,y be of mycobacterial origin 
or of other bacterial origin, or mich .top codon and 
transcription termination .It. may be tho.e of the DNA encoding 
the protein or polypeptide which Include, an epitope which is 
recognized by cytotoxic T lymphocyte.. 

Preferably, the mycobacterial promoter 1. a BCC promoter, 
and the mycobacterlum 1. BCC. 

In on* embodiment, the exprea.ion vector may further include 
DNA which encode, for protein, or polypeptide, .uch but not 
limited to, antigen., anti-tumor agent., enzyme., lymphoklnes. 
pharmacologic agent., immunopotentiator.. reporter molecule, of 
interest in a dl.gno.tlc context, and .electable markers. 

Selectable marker, which may be encoded Include, but are not 
limited to. the e-galacto.lda.e marker, the kanamycln resistance 
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marker, the chloroamphenicol resistance marker, the neomycin 
resistance marker, and the hygromycin resistance marker. 

In accordance with one embodiment, the vector further 
includes a mycobacterial origin of replication. 

In accordance with another embodiment, the vector may be a 
plasmid. The plasmid may be a non-shuttle plasmid, or may be a 
shuttle plasmid which further includes a bacterial origin of 
replication such as an E,coli origin of replication, a Bacillus 
origin of replication, a Staphvlococcug origin. of replication, a 
Streptomyces origin of replication, or a pneumococcal origin of 
replication. In one embodiment, the shuttle plasmid includes an 
E> coli origin of replication. 

In accordance with yet another embodiment, the vector may 
further include a multiple cloning site, and the DNA encoding fpr 
the protein or polypeptide, or fragment or derivative thereof, 
which includes an epitope which is recognized by cytotoxic 
T lymphocytes is inserted in the multiple cloning site. 

In addition to the DKA encoding a heterlogous protein or 
polypeptide, and the mycobacterial promoter for controlling 
expression of the DNA encoding the protein or |>olypeptlde which 
includes an epitope which is recognized by cytotoxic 
T lymphocytes, the expression vector may, in one embodiment, 
further include a DNA sequence encoding bacteriophage integration 
into a mycobmcterlua chromosome. Bacteriophages from which the 
DNA sequence encoding bacteriophage integration into a 
mycobacterltim chromosome may be derived include, but are not 
limited to, aycobacteriophages such as but not limited to, the 
L5, LI, Bxbl, and TM4 mycobacteriophages; the lambda phage of E. 
coli; the toxin phages of Corvnebacteria ; phages of Actinomvcetes 
and Norcardia ; the fC31 phage of Streptomyces : and the P22 phage 
of Salmonella . Preferably, the DNA sequence encodes 
mycobacteriophage Integration into a mycobacteritua chromosome. 
The DNA sequence which encodes bacteriophage integration into a 
mycobacterium chromosome may include DNA which encodes integrase. 
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Which is . protein that provide, for integration of the vector 
into the mycobacterial chromosome. Preferably, the DNA sequence 
encoding mycobacterial phage integration also includes DNA which 
encodes an attP site. 

The DNA encoding the attP site and the integrase provides 
for an integration event which is referred to as site-specific 
integration. DNA containing the attP site and the integrase gene 
is capable of integrating into a corresponding attB site of a 
mycobacterium chromosome. 

It is to be understood that the exact DNA sequence encoding 
the attP site may vary among different phages, and that the exact 
DNA sequence encoding the attB site may vary among different 
mycobacteria. 

Examples of expression vectors which include mycobacterial 
promoters and mycobacteriophag. promoters, for controlling the it 
least one DNA sequence encoding a protein or polypeptide, or 
fragment or derivative thereof, which Include. «n epitope which 
is recognized by cytotoxic T lymphocyte, .re further de.crlbed in 
.pplicatlon S.ri.1 No. 642,017, filed Jenu.ry 16, 1991, which is 
a continuation of Application Serial No. 552,828. filed July 16. 
1990, now abandoned. The contents of Application Serial No. 
642.017 are hereby Incorporated by reference. 

In another embodlBent, the mycobacteria are transformed with 
DNA which conpri... . flr.t DNA ..qu.nce which 1. . ph.g. DNA 
portion encoding bactcrlophege Integration into . mycob.cterium 
chromosome, mad DNA Including the .t le.st one DNA sequence 
encoding . protein or polypeptide, or fr.gncnt or derivative 
thereof, which Include, .n epitope which 1. recognized by 
cytotoxic T lymphocyte.. 

The term "ph.ge DNA portion", .. used herein, means that the 
DNA sequence 1. derived from . ph.ge .nd l.ck. the DNA which is 
required for ph.ge repllc.tlon. 

B.cterioph.gea from which the phege DNA portion may be 
derived include, but are not limited to, mycobacteriophage.. such 
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as but not limited to the those hereinabove described. 
Preferably, the phage DNA portion encodes mycobacteriophage 
integration into a rnycobacterium chromosome. 

In a preferred embodiment, the first DNA sequence includes 
DNA encoding integrase, which is a protein that provides for 
integration of the DNA into the mycobacterial chromosome. Most 
preferably, the first DNA sequence also includes DNA which 
encodes an AttP site. 

The DNA sequence encoding the AttP site and the integrase 
provides for an integration event which is referred to as 
site-specific integration. DNA containing the AttP site and the 
integrase gene is capable of integration into a corresponding 
AttB site of a rnycobacterium chromosome. 

It is to be understood that the exact DNA sequence encoding 
the attP site may vary among different phages, and that the exact 
DNA sequence encoding the attB site may vary among different 
mycobacteria. 

The integration event results in the formation of two new 
junction sites called AttL and AttR, each of which contain part 
of each .of AttP and AttB. The inserted and integrated non-phage 
DNA which includes the first DNA sequence and the at least one 
DNA sequence, which encodes a protein or polypeptide, or fragment' 
or derivative thereof, which includes an epitope which is 
recognized by cytotoxic T lymphocytes is flanked by the AttL and 
AttR sites. The inaertlon and integration of the phage DNA 
portion results in the formation of a transformed rnycobacterium. 

The ONA may further include DNA which encodes a selectable 
marker or markers; or other proteins or polypeptides of interest, 
such as, but not limited to anti -tumor agents, enzymes, 
lymphokines, pharmacologic agents, immunopotentiators, and 
reporter molecules of interest in a diagnostic context. 

Selectable markers which may be encoded include, but are not 
limited to, the kanamycin resistance marker, the neomycin 
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resistance marker, the chloroamphenicol resistance marker, and 

the hygromycin resistance marker. 

The phage DNA portion of the present invention whi-h 

includes the first DNA sequence encoding mycobacterium phage 

integration into a mycobacterium chromosome, and the at least one 
DNA sequence encoding a protein or polypeptide, or fragment 
derivative thereof, which includes an epitope recognized by * 
cytotoxic T lymphocytes, may be constructed through genetic 
engineering techniques known to those skilled in the art in a 
preferred embodiment, the phage DNA portion may be a plasmid ^ 
including, in addition to the DNA encoding integration and the 
DNA encoding a protein or polypeptide, or fragment or derivative 
thereof, which includes an epitope recognized by cytotoxic 
T lymphocytes, an origin of replication for any of a wide variety 
of organisms, which includes, but is not limited to. E.coli 
Streptomyces species. Bacillus species. Staohvlo.o..,.. species 

"Pecies. Salmonella specie, and various species of 
pneumococci. Most preferably, the plasmid includes an origin of 
replication for E.coli . 

The phage DNA portion also may include a suitable promoter 
suitable promoter. Include, but are not limited to. mycobacterial 
promoters and mycobacterlophage promoters such as those 
hereinabove described. 

The promoter sequence may, in one embodiment, be part of an 
expression caaaette which also Includes a portion of the gene 
normally under the control of the promoter, as hereinabove 
described. For example, when a mycobacterial HSP60 or HSP70 
promoter 1. employed, the expression cassette may Include, in 
addition to the promoter, a portion of the gene for the HSP60 or 
HSP70 protein. When the expression cassette and the DNA encoding 
the protein or polypeptide, or fragment or derivative thereof, 
which includes an epitope which is recognized by cytotoxic T 
lymphocytes are expressed, the protein expressed by the cassette 
and the DNA encoding the protein or polypeptide Is a fusion 
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protein of a fragment of a mycobacterial protein (eg., the HSP60 
or HSP70 protein), and the protein or polypeptide, or fragment or 
derivaitve thereof, which includes an epitope which is recognized 
by cytotoxic T lymphocytes. 

In a preferred embodiment, the transcription initiation 
site, the ribosomal binding site, and the start codon. which 
provides for the initiation of the translation of mRNA, are each 
of mycobacterial origin. The stop codon, which stops translation 
of mRNA, thereby terminating synthesis of the protein or 
polypeptide, or fragment or deriviatve thereof, which includes an 
epitope which is recognized by cytotoxic T lymphocytes, and the 
transcription termination site, may be of mycobacterial origin, 
or of other bacterial origin, or such stop codon and 
transcription termination site may be those of the DNA encoding 
the protein or polypeptide, or fragment or derivative thereof, 
which includes an epitope which is recognized by cytotoxic 
T lymphocytes. 

Examples of DNA which Includes a first DNA sequence which is 
a phage DNA portion encoding bacteriophage Integration into a 
mycobacterium chromosome, and DNA including the at least one DNA 
sequence encoding a protein or polypeptide, or fragment or 
derivative thereof, which includes an epitope which is recognized 
by cytotoxic T lymphocytes are further described in application 
Serial No. 553,907. filed July 16, 1990, the contents of which 
are hereby incorporated by reference. 

Mycobacteria which are transformed which DNA which encodes 
for a protein or polypeptide or fragment(8) or derivative ( s ) 
thereof, which includes an epitope which. is recognized by 
cytotoxic T lya^hocytes. may be employed in a composition, such 
as a vaccine, for inducing a CTL response in an anlaml. The 
vaccine may be administered to a htiman or non-human animal. 

To form such a vaccine, the transformed mycobacteria are 
administered in conjunction with a suitable pharmaceutical 
carrier. As representative examples of suitable carriers there 
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be mentioned: mineral oil. alum. ,y„theti= polymers, e- 
Vehxcle, tor v.c=i„e. .re well known 1„ the ert and the ,ele"-<or 
Of a suitable vehicle i, deemed to be within the .cope of thos'I 
Skilled in the art from the teachin,, contained her.L 1^' 
selection of a suitable vehicle is .i.o dependent upon the manner 
.n Which the vaccine i. to be administered. The valine :arb: 
in the form of an injectable dose .„d m.y fc. administered 
intramuscularly, intravenously, orally, intrademally, or bv 
subcutaneous administration. 

other means for administering the vaccine or therapeutic 
agent should be apparent to those slcilled in the art from the 
teachings herein; accordingly, the scope of the invention is not 
to be limited to a particular delivery form. 

When the transformed mycobacteria are employed as a vaccine 
such a vaccine ha. in^rtant advantage- over other presently 
available vaccines. Mycobacteria have, a. hereinabove indicated 
adjuvant properties among the best currently known and 
therefore, stimulate a recipient's Immune system to respond with 
great effectiveness. This aspect of the vaccine induces 
cell-mediated immunity and thus is especially useful in providing 
immunity against pathogens in case, where cell-mediated immunity 
appears to be critical for resistance. Also, mycobacteria may 
stimulate long-term memory or immunity. It thus maybe possible 
to prime long-lasting T cell memory, which stimulates secondary 
antibody responses neutralixing to the infectious agent. Such 
priming of T cell memory is useful, for example, against 
pertussis, malaria, influenza virus. Herpes virus, rabies. Rift 
Valley fever virus, dengue virus, measles virus. Human 
Immunodeficiency Virus (HIV), and respiratory syncytial virus. 

The invention will now be described with respect to the 
following examples; however, the scope of the present invention 
la not to be limited thereby. 

Example 1 



wo 92/2 1 376 PCT/US92/04538 

-12- — 

A. Constru ction of plasmid including wvcobacterial promot-^r 
expression cassette and lacZ oeng. 
1, Construction of PYUB125 

Plasmid pALSOOO, a plasmid which contains an origin of 
replication of M, fortuttum , and described in Labidi, et al., 
FEMS Microbiol, Lett ., Vol. 30, pgs. 221-225 (1985) and in Gene . 
Vol. 71, pgs. 315-321 (1988), is subjected to a partial Sau 3A 
digest, and 5kb fragments are gel purified. A 5kb fragment is 
then ligated to Bam HI digested pIJ666 (an. E. coli vector 
containing an E. coli origin of replication and also carries 
neomycin-kanamycln resistance, as described in Kieser, et al.. 
Gene, Vol. 65, pgs. 83-91 (1988) to form plasmid pYUB12. A 
schematic of the formation of plasmid pYUB12, A schematic of the 
formation of plasmid pYUB12 is sho%m in Figure 1. pYUB12 and 
pIJ666 were then transformed into M. smegma tis and BCC. 
Neomycin-resistant transformants that were only obtained by 
pYUB12 transformation confirmed that pALSOOO conferred autonomous 
replication to pXJ666 in M. smeqmatis and BCC. 

Shotgun mutagenesis by Snapper, et al (1988, hereinabove 

cited) indicated that no more than half of the pALSOOO plasmid 

was necessary to support plasmid replication in BCC. This 

segment presumably carried open reading frames ORFl and 0RF2, 

identified by Rauzler, et al.. Gene , Vol. 71, pgs. 315-321 

(1988), and also prsstinably carried a mycobacterial origin of 

replication. pyUB12 is then digested with Hpal and EcoRV, a 2586 

bp carrying this rsgion or segment pALSOOO Is removed and ligated 

to PvuII digested pYUB8. Plasmid pYUBB (a pBR322 derivative) 

o 

includes an E. coli replicon and a kan (aph) gene. Ligation of 
the 2586 bp pyUB12 fragment to PvuXI digested pYUBB results in 
the formation of pYUBSS, as depicted in Figure 2. Transformation 
of pYUB53 confirmed that the EcoRV-Hpal fragment, designated 
M.rep, was capable of supporting autonomous replication in BCC. 

Plasmid pYUB53 was then digested with AatI, EcoRV, and PstI 
in order to remove the following restriction sites: 
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A«tl 5707 

EcoRI 5783 

BanHI 5791 

S«1I 5797 

PstI 5803 

PstI 7252 

Sail 7258 
BainHI 7264 
EcoRI 7273 
Clal 7298 
Hindi I I 7304; and 
EcoRV 7460 

Fragment ends are then flushed with T4 DNA polymerase and 
reUgated to fona placid pYUB125. construction of which is shown 
in Figure 3. 

1^ Plinination of superfluous v«>r tor DMA fypm dYUB12S 

792 bases of the tet gene, which had been inactivated by 

prior manipulation., was eliminated by a complete Narl digest. 

gel purification of the 6407 bp fragment, and 

ligation/recirculation, transformation of E. coll strain HBlOl, 
and selection of Kaa'^ transformants. The construction of 
resulting plasmid, pMVlOl, is schematically indicated in Figure 
4, and the DHA sequence of pMVlOl, which includes markings of 
regions which will be deleted, and of mutations, as hereinafter 
described, is shown In Figure 5. 

^ Constnictlon ef expressio n caaaette baaed on BCC HSP60. 

Among the most abundant proteins in mycobacteria is the 
HSP60 heat shock protein (also known as the 65 kda antigen). 
Because abundance of the HSP60 protein in mycobacteria indicates 
strong HSP60 gene expression, the sequence controlling HSP60 
expression was chosen to control expression of heterologous genes 
encoding antigens or other proteins in BCG. 

The published sequence of the BCC HSP60 gene (Thole, et al. 
Infect, and Immun. . Vol. 55, pgs. 1466-1475 (June 1987)), and 
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surrounding sequence permitted the construction of a cassette 
carrying expression control sequences (i.e.. promoter and 
translation initiation sequences) by PCR. The BCC HSP61 cassette 
(Figure 6) contains 375 bases 5' to the BCC HSP60 start codon. 
and 15 bases (5 codons)-3' to the start codon. PCR 
oligonucleotide primers were then synthesized. Primer Xba-HSP60; 
of the following sequence: 

CAC ATC TAG ACC GTC ACC ACA ACC CGC C / 
was synthesized for the 5* end of the cassette. and primer 
Bam-HSP61, of the following sequence: 

CTA CGC ATC CCC AAT TCT CTT CGC CAT TC Jl 
was synthesized for the 3* end of the cassette. The primers were 
used to amplify the cassette by PCR from BCC strain Pasteur 
chromosomal DNA. The addition of the Bam HI site at the 3* end 
of the cassette adds one codon (Asp) to the first six codons of 
the HSP60 gene. 

Each of pMVlOl and the PCR cassette HSP61 was digested with 
Nhel and BaraHI. The PCR cassette was then Inserted between the 
Nhel and BaaRI sites of pMVlOl, then ligated to form plasraid 
PMV65A (Figure 7). 

The E. coli lac 2 gene (Figure 8) was used as a reporter, or 
marker gene to assay the ability of the HSP61 cassette to express 
heterologous genes in BCC. A BamHI restriction fragment carrying 
the lac Z gene was cloned into the Bam HI site of Bam HI digested 
pMVeSA, resulting in the formation of pMV65A/LZ as indicated 
schematically in Figure 7. The formation of pMV65A/LZ results in 
a fusion between the HSP60 and lac Z genes at the sixth codon of 
the HSP60 gene and the sixth codon of the lac Z gene. pMV65A/LZ 
was then transformed into E. coli . Blue E. coli colonies were 
selected on x-gal plates for the presence of pMV65A/LZ, thus 
indicating that the HSP60 promoter and translation initiation 
sequences were also active in E. coli . 

pMV65A/LZ was then transformed into BCC and plated on Dubos 
Oleic Agar plates containing x-gal. All BCC colonies resulting 
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from this transformation exhibited blue color, thus indicating 
that the 1^ gene product (B-gal.ctosidase ) w.s expressed in 
BCC. SDS polyacrylamide gel electrophoresis was performed on 
lysates of the pMV65A/LZ BCC recombinants, revealing that 
B-galactosida»e protein was expressed to levels in excess of lo-/ 
of total BCC protein (as determined by staining with Coomassie " 
brilliant blue). These data Indicated that BCC HSP61 expression 
cassette was functional in expression vector pMV6SA. 

Example ? 

Cytotoxic T Iymphocvti> response r . coll Uctosid,., 

E^_coU B-gal.ctosld««e was expressed in BCC as a six amino 
acid fusion protein with BCC hsp 60 protein using 
extrachromosomal plasmid vector pMV65A/LZ utilizing the HSP60 
promoter to drive expression. The recombinant BCC was grown ta 
mid- log phase in Dubos media and concentrated by centrifugation. 
The bacteria were then re-suspended in PBS plus 0.05% Tween 80- 
and cup sonicated briefly to disperse clumped bacteria Six week 
old BALB/c mice were inoculated with a single dose of 2 X 10^ 
2 X 10 , or 2 X 10 colony forming units (CFU'S determined 
post-inoculation) by either intradermal (ID), intraperitoneal 
(IP), or intravenous (IV) injection. At 14 or 19 weeks 
post- immunization splenocytee were harvested from mice and CTL 
activity was measured. CTL activity was measured as follows: 
Splenocytes (ACK-treated, 5 X loVml) were stimulated in 
vitro in 10 al in upright T25 flasks by co-culture for 5 days 
with mitomycin C-treated cells transfected with the lac Z gene 
(C3-4 cells; 5 X loVml). A 4 hr.^^Cr release assay was then 
performed in triplicate using P815 and P13.1 cells (P815 ceils 
transfected with the lac Z gene) as targets. Various 
effector-target ratios were tested using 5,000 targets/well. 
Specific lysis was calculated as follows: % specific lysis = 
100 X (release by effector cells minus spontaneous release/ 
maximal release minus spontaneous release). 
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At 19 weeks post-inununization, the remaining animals in eac 
group were boosted by intraperitoneal injection at 10 ug of 
purified lac 2 emulsified in incomplete Freund's adjuvant (IFA) 
Splenocytes were then harvested from these animals at 23 weeks 
and CTL activity was again measured. Unimmunized animals or 
animals immunized with lac 2 emulsified in IFA, or animals 
immunized with vaccinia virus expressing lac 2 served as 
controls. 

The results of the avove experiments, as determined by % 
specific lysis of target cells, indicated that a CTL response' wa 
induced in mice immunized with BCC transformed with the 
expression vector pMV65A/L2. 

Example 3 

Construction of integrating plasmld including mvcobacteri«T 
promoter exnression c assette and HIV-I»gp 120 gene . 

^- Elimination of und esirable restriction sites In aph rk«n^) 
gene . 

To facilitate future manipulations, the Hindlll and Clal 
restriction sites in the aph gene In plasmid pMVlOl were 
mutagenized siraultaneously by polymerase chain reaction (PCR) 
mutagenesis according to the procedure described in Gene . Vol. 77 
pgs. 57-59 (1989). The bases changed in the 'aph gene were at the 
third position of codons (wobble bases) within each restriction 
site and the base stibstltutions made were designed not to change 
the amino acid sequence of the encoded protein. 

Separate PCR reactions of plasaid pHVlOl with primer 
ClaNut-KsB * HindRMut-Ran and RindFMut-Kan ^ Ban-Kan were 
performed at 90«C (1 min.). 50*C (1 min. ), and 72*C (1 min. ) for 
25 cycles. The PCR primers had the following base seqeuences: 

ClaMut-Kan 

CTT CTA TOO CAA CCC CC 3 
HindRMut-Kan 

GTG AGA ATG CCA AAA GAT TAT GCA TTT CTT TCC AC t 
HindFWut-Kan 
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G7G TGC AAA GAA ATG CAT AAT CTT TTG CCA TTC TCA CCC G r 
Bam- Kan 

CC7 AGA GCA TCC AGA GCA CG ^ 
The resulting PGR products were gel purified and mixed and a 
single PCR reaction without primers was performed at 94»C (1 
min.). 72«C (1 min. ) for 10 cycles. Primers ClaMut-Kan and 
Barn-Kan were added and PCR was resumed at 94«C (1 min.) SO^C (l 
min.). and 72«C (2 min.) for 20 cylcea. The resulting PCR 
product (Kan. mut) was digested with BamHI and gel purified 
Plasmid pMVlOl was digested with Cl.I and cohesive ends were 
filled in by Klenow * dCTP * dCTP. Klenow was heat inactivated 
and the digest was further digested with BamHI. The 5232 base 
pair fragment was gel purified and mixed with fragment Kan. mut 
and ligated. ^The ligation was transformed into E. coll strain 
HBlOl and Kan colonies were screened for plasmids resistant to 
Clal and Hindi 1 1 digestion. Such plasmids were designated as 
pMVllO. which is depicted in Figure 4. 

— Elimination of aeouencea not neeeaaarv for plaamld 

replication In mycobacteria. 

Plasrald pMVllO was resected in separate constructions to 
yield plasmids pMVlll and pMV112. In one construction, pMVllO 
was digested with Narl and Ball, the ends were filled In. and a 
5296 base pair fragment was ligated and reclrcularlzed to form 
pMVlll. In another construct. pMVllO was digested with Ndel and 
SplI, the ends were filled in, and a 5763 base pair fragment was 
ligated and reclrcularlzed to form pMV112. Schematics of the 
constructions of pMVlll and pMV112 are shown In Figure 9. These 
constructions further eliminated superfluous E. coll vector 
sequences derived from pAL5o6o not necessary for mycobacterial 
replication. Cloning was performed in E. coll . Plasmids pMVlll 
and pMV112 were tested for the ability to replicate in Mj. 
smectmatia. Because both plasmids replicated In M. smeomatls the 
deletions of each plasmid were combined to construct pMV113. 
(Figure 9). 
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To construct pMV113. pMVlU was digested with BamHI and 
EcoRI, and a 1071 bp fragment was isolated. pMvll2 was digested 
with BamHI and EcoRI, and a 3570 bp fragment was isolated, and 
then ligated to the 1071 bp fragment obtained from pMVlll'to form 
PMV113. These constructions thus defined the region of pALSOOO 
necessary for autonomous replication in mycobacteria as no larger 
than 1910 base pairs. 

^ Mutagenesis of reatrir-tion sit^,. t n mvcQbi„.i->^.i ,i 

replicon. 

To facilitate further manipulations of the mycobacterial 
replicon, PGR mutagenesis was performed as above to eliminate the 
Sal I. EcoRI. and Bglll sites located in the open reading frame 
known as ORFl of pALSOOO. PGR mutagenesis was performed at 
wobble bases within each restriction site and the base 
substitutions were designed not to change the amino acid seguenoe 
of the putative encoded ORFl protein. The restriction sites were 
eliminated one at a time for testing in mycobacteria. It was 
possible to eliminate the Sail and EcoRI without altering 
replication in M. smegma ti a. in one construction PGR mutagenesis 
was performed at EcoRI 1071 of pMV113 with primers Eco Mut - M.rep 
and Bam-M.rep to form pMV117, which lacks the EcoRI1071 site. 
Primer Eco Mut - M.rep has the following sequence: 

TCC GTC CAA GGA CTC TCC CCG A; 7 

and Bam-M.rep has the following sequence: 

GAG CCG TCC TCT CCA TCC TCT AC. ^ 
In another construction, PGR mutagenesis was performed at 
the Sail 1J89 site with primer Sal Mut - M.rep and Bam-M.rep to 
form pMV119, which lacks the Sail 1389 site. Primer Sal Mut- 
M.rep has the following sequence: 

TCC CCA CCC GAG TTA CTC AGO CGT. f 
pMV117 was then digested with ApaLI and Bglll, and a 3360 bp 
fragment was isolated. pMV119 was digested with ApaLI and Bglll, 
and a 1281 bp fragment was isolated and ligated to the 3360 bp 
fragment isolated from pMV117 to form pMV123. A schematic of the 
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constructions of pl.smids pMVii7. pMvll9. and pMV123 is shown 
Figure 10. Elimination of the Bglll site, however, either by P^r 
mutagenesis or Klenov fill m. eliminated plasmid replication in 
mycobacteria, thus suggesting that the Bglll site is in proximity 
to, or within a sequence necessary for mycobacteria plasmid 
replication. 

^ Construct ion of dMV^OO sari.*, vector*! 

To facilitate manipulations of all the components necessary 
for plasmid replication in E^coli and mycobacteria, (E. rep and 
M. rep. ) and selection of recombinants (Kan^) , cassettes of each 
component were constructed for simplified assembly in future 
vectors and to include a multiple cloning site (MCS) containing 
unague restriction sites and transcription and translation 
terminators. The cassettes were constructed to allow directional 
cloning and assembly into a plasmid where all transcription is 

unidirectional, 
p 

Kan Caaaett* 

A DNA cassette containing the aph (Kan^) gene was 
constructed by PCR with primers KanS' and Kan3'. An Spel site 
was added to the 5* end of the PCR primer Kan3', resulting in the 
formation of a PCR primer having the following sequence: 

CTC GAC TAG TCA OCT CTG OCT CGT CAA G. /O 

Bam HI ♦ Nhel sites were added to the 5' end of the primer 
Kan5', resulting in the formation of a PCR primer having the 
following aeq^uence: 

CAC AGO ATC CTT AGC TAG CCA CT GAC CTC GCC G. // 

PCR was perforaed at bases 3375 and 4585 of pffV123, and 
BamHI and Nh«Z sites were added at base 3159, and an Spel site 
was added at base 4585. Digestion with BamHI and Spel, followed 
by purification resulted in a 1228/2443 Kan^ cassette bounded by 
BamHI and Spel cohesive ends with the direction of transcription 
for the aph gene proceeding from BamHI to Spe I. 

E . rep . cassette 
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A DNA cassette containing the ColEI replicon of pUC19 was 
constructed by PGR with primers E.rep/Spe and E.rep/Mlu. An Spel 
site was added to the 5' end of PGR primer E.rep/Spe and an Mlul 
site was added to the 5* end of PGR primer E.rep,/Mlu. The 
resulting primers had the following sequences: 

E , rep , /Spe 

CCA CTA CTT GGA GTG ACC GTG ACA CGC 1^ 
E. rep./Mlu 

GAG AAG GGC TTC GCG TGC GTG CTT CGC GTG. \S 
PGR was performed at bases 713 and 1500 of pUG19, and an 
Nlul site was added to base 713, and a Spel site was added to 
base 1500. Digestion with Mlul and Spel, followed by 
purification resulted in an E.rep. cassette bounded by Spel and 
Mlul cohesive ends with the direction of transcription for RNA I 
and RNA II replication primers proceeding from Spel to Mlul. 
M. rep, cassette 

A DNA cassette containing sequences necessary for plasmid 
replication in mycobacteria was constructed by PGR of pMV123 with 
primers M.rep/Nlu and M. rep/Bam. An Mlul site was added to the 
5* end of PGR primer M.rep/Mlu. A BamHI site was added to the 5* 
end of PGR primer H. rep/Bam. The resulting PGR primers had the 
following base sequences: 

M.rep./Mlu 

GGA TAG CCG TGA CGC GAG GAG GTG CO 
M. rep. /Bam 

GAG CGC TCG TCT CCA TGC TGT AG 

PGR was performed at bases 134 and 2082 of pMV123 . An Mlul 
sited was added to base 2082. Digestion with BamHI and Mlul, 
followed by gel purification resulted in a 1935 base pair DNA 
cassette bounded by Mlul and BamHI cohesive ends with the 
direction of transcription for the pALSOOO ORFl and 0Rr2 genes 
proceeding from Mlul to Bam HI. 

The Kan^, E.rep, and M.rep PGR cassettes were then mixed in 
equimolar concentrations and ligated, and then transformed in E^ 
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coU strain HBlOl for selection of K.n^ transformants. Colonies 
were screened for the presence of plasmids carrying all three 
cassettes after digestion with BamHI * Mlul . Spel and designated 
PMV200. An additional restriction site, Ncol, was eliminated 
from the M.rep cassette by digestion of pMV200 with Ncol, nil in 
with Klenow. and ligation and recircularization. resulting in the 
formation of pMV201. A schematic of the formation of pMV200 from 
PMV123 and pUC19. and of pMV201 from pMV200. ia shown in Figure 
11. Plasmids PMV200 and pMV201 were transformed into 
smeqmatis and BCC. Both plasmids yielded Kan^ transformants 
thus indicating their ability to replicate in mycobacteria. 

A synthetic multiple cloning sequence (MCS) (Figure 12) was 
then designed and synthesized to facilitate versatile molecular 
cloning and manipulations for foreign gene expressions in 
mycobacteria, and for integration into the mycobacterial 
chromosome. The synthetic MCS. shown in Figure 12, contains 16 
restriction sites unique to pMV201 and includes a region carrying 
translation stop codons in each of three reading frames, and a 
transcription terminator derived from E. coll 5S riboacmal RNA 
(Tl). 

To insert the MCS cassette. pMV201 was digested with Narl 
and Nhel, and the resulting fragment was gel purified- The MCS 
was digested with HinPI and Nhel and. the resulting fragment was 
gel purified. The two fragments were then llgated to yield 
pMV204. A schematic of the construction of pHV204 is sho%m in 
Figure 13. 

Plasaid pMV204 was then further manipulated to facilitate 
removal of the M.rep cassette in further constructions, pMV204 
was digested with Mlul, and an Mlul - Not I linker was Inserted 
into the Mlul site between the M.rep and the E.rep to generate 
pMV206. A schematic of the construction of pHV206 from pMV204 is 
shown In Figure 14, and the DNA sequence of pNV206 is given in 
Figure 15. 

S. Cons truction of expression cassette based on BCC HSP60. 
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The HSP61 cassette (Figure 6) was constructed as hereinabov 
described in Example 1. 

Each of PMV206 and the PGR cassette HSP61 was digested with 
Xbal and BaitiHI . The PGR cassette was then inserted between the 
Xbal and BamHI sites of pMV206, then ligated to form plasmid 
pMV261. The construction of this plasmid is shown schematically 
in Figure 17. The reading frame and the restriction sites of th 
multiple cloning site of pMV261 is shown in Figure 16. 

The E. coli lac 2 gene was used as a reporter, or marker 
gene to assay the ability of the HSP61 cassette to express 
heterologous genes in BCG. A BamRI restriction fragment c 
the lac 2 gene was cloned into the Bam HI site of Ban HI dlgeste 
pMV261, resulting in the formation of pMV261/L2. A schematic of 
the construction of pMV261/L2 is shown in Figure 18. The 
formation of pKV261/L2 results in a fusion between the HSP60 anO 
lac_2 genes at the sixth codon of the HSP60 gene and the sixth 
codon of the lac 2 gene. pMV261/L2 was then traneforraed into E^ 
coll. Blue E. coli colonies were selected on x-gal plates for 
the presence of pMV261/L2, thus indicating that the HSP60 
promoter and translation initiation sequences were also active in 
E. coli . 

PMV261/L2 was then transformed into BCG and plated on Dubos 
Oleic Agar plates containing x-gal. All BCG colonies resulting 
from this transformation exhibited blue color, thus indicating 
that the lac 2 gene product (B-galactosidase) was expressed in 
BCG. SDS polyacrylaBlde gel electrophoresis was performed on 
lysates of the pMV261/L2 BCG recobinants, revealing that 
B-galacteeidase protein was expressed to levels in excess of 10% 
of total BCG protein (as determined by staining with Coomassie 
brilliant blue). These data indicated that BCG BSF61 expression 
cassette was functional in expression vector pMV261. 

Plasmid pHV261/L2 was then shown to replicate autonomously, 
and express the E. coli B-galactosidase, or lac2 gene, driven by 
the BCG promoter BSP60, in M. sraegmatis and BCG. 
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T^«»«fer of mYcobacterio ph aqe L5 ^n^>^ ^. tion s.»„>..,. 

to BC G exprggsion vector. 

Plaamid pMH9.4, which includes the ntycobacteriaphage L5 a-tP 
site, and the L5 integrase gene, was employed in providing the L5 
integration sequences to a BCG expression vector. The 
construction of pMH9.4. as well as its integration into 1^ 
smeqmatis and BCG. is described below in sections (i) through 
(vi). 

identification of the DNA ^eouenc,, »f .k , .ttarhn^.n. 

attB. attL. and attR. of M. «in#ig m.i-^ , 

Using standard technologies, a lambda EMBL3 library was 
constructed using chromosomal DNA prepared from mc^Sl (a strain 
of M. smecpnatis which include, an M. ,mea»,i.i, chromosome into 
which has been integrated the genome of mycobacterial phage L%) 
and digested with Baa HI. Phage L5 contains DNA having 
restriction sites identical to those of phage LI (Snapper, et al. 
1988), except that L5 is able to replicate at 42''C and phage LI 
is incapable of such growth. This library was then probed with a 
6.7 kb DNA fragment isolated from the L5 genome that had been 
previously identified as carrying the attP sequence (Snapper, et 
al 19 88). One of the positive clones was plaque purified. DNA 
prepared, and a 1.1 kb Sal I fragment (containing the AttL 
sequence) sub-cloned Into sequencing vector pUC119. The DNA 
sequence oif this fragment was determined using a shotgun approach 
coupled with Sanger sequencing. By isolating and sequencing the 
attL junction site and comparing this to the DNA sequence of L5 
that was available, a region was determined where the two 
sequences aligned but with a specific discontinuity present. The 
discontinuity represents one side of a core sequence, which is 
identical in AttP, attB, and attL. The region containing the 
recombinational crossover point is sho%ni in Figure 19. 

The attL DNA (1.1 kb Sal I fragment) was used as a probe to 
hybridize to a Southern blot of Bam HI digested mc^6 DNA, which 
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is a strain of M. smeqmatin which includes an M. smeam.^s 
chromosome without any phage integration (Jacobs, et al 1987 
hereinabove cited.). A single band of approximately 6.4' kb wa 
detected corresponding to the attB sequence of M. sm^ar..^.. 
This same attL probe was used to screen a cosmid library of mc^ 
(provided by Dr. Bill Jacobs of the Albert Einstein College o 
Medicine of Yeshiva University), and a nuinber of positive cosmi 
clones were identified. DNA was prepared from these clones, an 
a 1.9 kb Sal I fragment (containing tha attB site) tha 
hybridizes to the attL probe was subcloned into pUC119 fo 
sequencing and further analysis. The DNA sequence containing th 
core sequence was determined and is shown in Figure 19. The coi 
sequence, which is identical in attP. attB and attL. has a leng 
of 43bp. 

The rac^ei lambda EMBL3 library was then probed with tS 
1.9kb Sail fragment containing the attB site. Positive plaques 
were identified, DNA was prepared, and analyzed by restriction 
analysis and Southern blots. Lambda clones were identified that 
contained a 3.2kb Bam HI fragment containing the putative attR 
site. The 3.2Jcb Baa HI fragment was purified and cloned into 
pUC119 for sequencing and further analysis. 

ilU Deteraination of a ttP» Integra se region of LS oenome . 

Concurrent with the above procedures, a significant 
portion of the IXiA sequence of L5 had been determined and 
represented in several "contigs" or islands of DNA sequence. 
Sequences of the 6.7)cb Baa HI fragment hereinabove described were 
determined by (a) analysis of the location of Baa HI sites in the 
contigs of the DNA of L5, and (b) by deteraining a short stretch 
of DNA sequence froa around the Bam HI sites of pla.smid pJR-1 
(Figure 24), which carries the 6.7kb Baa HI fragment of LS. 

A segment of DNA sequence ' was located that represented the 
6.7kb Baa HI fragaent of phage LS. Studies of other phages have 
shown that the integrase genes are often located close to the 
attP site. It was thus determined that the LS integrase (int) 
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gene should lie either within the 6.7)cb Bam HI fragment or a 
DNA sequence on either side of it. The DNA sequence in "the 
regions was then analyzed by translating it into all six possiblo 
reading frames and searching these amino acid sequences ^o^ 
similarity to the family of integrase related proteins In^ 
through computer-assisted analysis of the DNA sequence As shown 
in Figure 20, there are shown two domains of reasonably good 
conservation among L5 integrase and other integrases. and three 
ammo acid residues that are absolutely conserved in domain 2 
(See Yagil. et al., J. Mol. Biol., Vol. 207. pg». 695-717 (1989)" 
and Poyart-Salmeron, et al., J. EMBO. . Vol. 8. pgs. 2425-2433 
(1989)). A region was identified, and analysis of the 
corresponding DNA sequence showed a reading frame that could 
encode for a protein of approximately 333 amino acids. These 
observations identified the putative int gene. 

The location of the int gene was not within the 6.7kb Bam .HI 
fragment; however, it was very close to it with one of the Bam HI 
sites (that defines the 6.7kb Bam HI fragment) less than 100 bp 
upstream of the start of the gene. Analysis of the Bam HI sites 
showed that the int gene lay within a 1.9kb Bam HI fragment 
located adjacent to the 6.7)tb Bam HI fragment. This 1.9kb Bam HI 
fragment was cloned by purification of the fragment from a Bam HI 
digest of L5 DNA and cloning into pUC 119, to generate pMHl 
(Figure 25). 

From a combination of the above approaches, a schematic of 
the organization o£ the attP-int region of L5 was constructed 
(Figure 26), and the gene sequence of the attP-int region is 
given in Figure 22. 

(iii) Construction of pMHS . 

The 6.7kb Bam HI fragment of mycobacteriophage L5, which 
contains the attP site, as hereinabove described, was cloned into 
the Bam HI site of pUC 119 (Figure 23). This was achieved by 
purifying the 6.7kb Bam HI fragment from a Bam HI digest of L5 
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DNA separated by agarose gel electrophoresis and ligating wit. 
Bam HI cut pUC 119. DNA was prepared from candidate recombinant 
and characterized by restriction enzyme analysis and ge 
electrophoresis. A recombinant was identified that contained th 
6.7kb Bam HI fragment of L5 cloned into pUC 119. This plasmi^^ 
was named pJR-1, as shown in Figure 24. 

Analysis of DNA sequence data from a project to sequence L5 
showed that a 1.9kb Bam HI fragment adjacent to the 6.7kb Bam HI 
fragment hereinabove described contained the integrase gene. 

A plasmid containing a 1.9kb Bam HI fragment containing the 
DNA encoding for the integrase cloned into the Bam HI site of pUC 
119 was constructed. The 1.9kb fragment was purified from a Bam 
HI digest of L5 DNA and cloned into the Bam HI site of pUC 119 
Construction of the recombinant was determined by restricti.on 
analysis and gel. electrophoresis. This plasmid was called pMlfl, 
the construction of which is shown schematically in Figure 2S. 

pJR-1 was then modified by digestion with EcoPI and SnaBI 
(both are uniqu« cloning sites), between which is a Bam HI site. 
The EcoRI -SnaBI fragment, including the Ban HI site was excised, 
and the. plasmid was religated to form plasmid of pMH2. which 
contains on Bam HI site compared to two Bam HI sites contained in 
pJR-1. A schematic of the construction of pKH2 is shown in 
Figure 26. 

The 1.9kb Bam HI fragment, which includes the integrase 
gene, was purified from a Bam HI digest of pKHl and ligated to 
Bam HI digested pKH2. Recombinants were identified as above and 
the orientation of the 1.9kb fragment determined. A plasmid 
called pMB4 was thus constructed (Figure 27) in which the region 
from the Sna BI site (upstream of attP) through to the Bam HI 
site (downstream of the integrase gene) was identical to that in 
L5. 

pMH4 was digested with Hindi I I (unique site) and was ligated 
to a Ikb Hindi I I fragment purified from pKD43 (supplied by Keith 
Derbyshire of the Nigel Cindley Laboratory) that contains the 
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gene determir. .g resistance to kanamycxn. Racoi^inant^ 
identified and characterized as above. This plasmid Calll^ 
pMH5. A schematic of the construction of pMH5 is shown in Fi^r- 
28. 

(iv) Integration of pMHS into ati^R of m. sm.>rp .^^. 

Plasmids PYUB12 (a gift from Dr. Bill Jacobs, a schematic of 
the formation of which is shown in Figure 1). pMDOl (Figure 29) 
and pMHS were electropo rated, with four different concentrations 
of^plasmid DMA over a l^OOO-fold range, into M. smeomati. strain 
mc 155, a strain which is able to support plasmid replication 
In sections (iv) through (vi), all electroporation procedures of 
M. smeomatia. or of BCC, were carried out as follows: 

Cultures of organism were grown in Middlebrook 7H9 media, as 
described by Snapper, et al. (1988). harvested by centrifugation. 
washed three times with cold 10% glycerol, and reauspended at 
approximately a 100 x concentration of cells. 

1 Ul of DMA was added to 100 ul of cells in an ice-cold 
cuvette and pulsed in a Bio-Rad Gene Pulser. and given a singi^e 
pulse at 1.25 kv at 25 nF. 1 ml of broth was added the cells 
incubated for 1 hr. at 37»C for expression of the antibiotic- 
resistant marker. Cells were then concentrated and plated out on 
Middlebrook or tryptlc soy media containing 15 pg/ml kanamycin. 
Colonies were observed after 3 to 5 days incubation at 37 "C. 

Each of pYUB12, pMDOl. and pMH5 carries kanamycin 

resistance. Plasmid pYUB12 carries an origin of DNA replication, 

while pMDOl lacks a mycobacterial origin of replication. Plasmid 

pMHS does not carry a mycobacterial origin of replication, but 

carries a 2kb region of phage LS which contains the attP site and 

the integrase gene (Figure 22). The number of transformants were 

linear with DNA concentration. Plasmid pYUB12 gives a large 

number of transformants (2 x 10^ per ug DNA) in mc^lSS, while 

4 

pMH5 gives 6 x 10 transformants per.pg DNA, and pMDOl gives no 
trans f ormants . 
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The above experiment was then repeated by electropcratin 
the plasmids pyUB12. pMDOl. and pMH5 into M. smeom^ri. strai 



mc^6. 



which does not 



support plasmid replication, 



N 



transformants in mc 6 were obtained from pYUB12 or pMDOl. whil 
pMH5 gave approximately 10* kanamycin resistant transformants \ 
mc^6 per ug of DNA, thus indicating integration of pMH5 into th 
mc 6 chromosome. 

DNA from^six independent pMHS transformants (four in mc^ 15 
and two^in mc^S) was prepared. These DNA's (along with DNA fro 
both mc 155 itself, and mc2l55 carrying the plasmid pYUB12) wer 
digested with a restriction enzyme, and analyzed by Southern bio 
and hybridization with the M. smeomatln l.gkb 
described. 



attB prob 
30, all si 



hereinabove described. As shown in Figure 
transformants have integrated into the attB site, resulting i 
the production of two new DNA fragments with differen 
mobilities. If pMH5 did not integrate into the attB site, i 
would be expected that a single band, corresponding to the attB 
site in the mc^l55 control, would be obtained. 
(V) Construction of dMH9.2 and pWHg.4 

PUC119 was digested with Hindlll, and a Ikb Hindi I I 
fragment, containing a kanamycin resistance gene, purified from 
pKD43, was ligated to the Hindi I I digested pUC119 to form pMH8 
(Figure 31). A 2kb Sail fragment (bp 3226-5310), which carries 
the attP and integrase gene from Sail digested pMB5, was purified 
and inserted in both orientations relative to the vector backbone 
of Sail digested pMH8 to form plasmids pMR9.2 and pMH9.4 (Figures 
32 and 33). 

2 

M. — oaatls strain mc 155 cells carrying, as a result of 
electroporatlon, plasraid pYUB12, pNR9.2 or pMH9.4, or strain rac^6 
cells carrying plasmid pMH5, as a result of electroporation as 
hereinabove described, were grown to saturation in broth wit)) 
kananmycin. Cultures were then diluted 1:100 into broth without 
kanamycin and gro%m to saturation. Two further cycles of 
dilution and growth were done, corresponding to about 20 
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generations of bacterial growth. Cultures were plated 
single colonies on non-selective plates, and approximately lOO of 
these colonies were patch plated onto both non-selective and 
selective plates. The % of colonies that were sensitive to 
kanamycin. thus corresponding to the percentage of cells which 
lost the plasmid. is given below in Table I. 

Table I 

loss 

pyUB12 (mc^lSS) 35 

pMH5 (mc^6) 17 

PMH9.2 (rac^lSS) 3 

PMH9.4 (mc^lSS) 0 
<vi) Tr anafomation of BCC with phm9.4 

The 1.9 kb Sal I fragment, which includes the M. smetimati^ 
attB site as hereinabove described was cloned into pUC119, a^Id 
the plasmid generated was naned p^-12. (Figure 34). 

Gel purified Sal I 1.9kb M. emeonatia fragment containing 
attB (isolated from pMH-12) was used to probe a Southern transfer 
of Bam HI digested mycobacterial DHA'a, including BCC substrain 
Pasteur, shown in Figure 35. This demonstrated that there is one 
Bam HI fragment of BCC that strongly hybridizes to the 
smeqraatis attB probe and three hybridize weakly. The strongest 
hybridizing band Is the fastest moving band (approximately 1.9 
kb). 

The same probe as above was used to probe a BCC cosmid 
library (provided by Dr. Bill Jacobs) and positive clones were 
identified. DNA was prepared from several positive clones and 
analyzed by restriction analysis and Southern blotting. The 1.9 
kb Bam HI fragment (corresponding to the strongly hybridizing 
band in the Southern blot was identified, gel purified from the 
cosmid DNA and cloned Into pUC119. The resulting plasmid was 
named pMH-15. (Figure 36). 

Plasmid pMH-5 and pMH9.4 were electroporated into BCC 
Pasteur. It was observed that pMH9.4 transforms BCC with high 
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efficiency (approximately 10* transformants/ug DNA), while pMK-^ 
transforms BCC at low efficiency (l-io transformants/ug DNA)' 
DNA was prepared from BCC transformants and analyzed by Bam HI 
restriction and Southern blot analysis, probing with gel purified 
1.9kb Bam HI BCC attB fragment from pMH-15. These data are shown 
in Figure 37 and show that integration of both pMH5 and pMH9.4 is 
specific to the BCC attB site (ie. the strongly cross-hybridiring 
fragment in BCC). This is illustrated by the loss of the 1.9kb 
Bam HI fragment from the transformants and the appearance of two 
new bands representing attL and attR junction fragments. Figure 
37 shows just one of the pMH5/BCC transformants, although all of 
the four that were analyzed show that one of the bands (the 
largest) is smaller than expected (and different in each of the 
transformants), indicating that the transformation effiency of 
pMH-5 is low in BCC. In contrast, the four pMH9.4 transformants 
are identical to «ach other (Figure 37) and give attR and attL 
junction fragments of the predicted aizea. 

Plasmid pMR9.4, which includes the mycobacterial phage L5 
attP site and the L5 integraae gene, was digested to completion 
with either Kpnl ♦ PvuII or Xbal ♦ PvuII, and a restriction 
fragment of 1862 or 1847 base pairs, respectively, each of which 
contain the attP site and the integrase gene, were purified by. 
agarose gel electrophoresis. Plasmid pMV261/LZ was digested with 
Xbal or Dral to generate either a 7569 bp or 7574 bp vector 
fragment. The 7569 bp fragment was llgated to the 1862 bp 
fragment derived from pHR9.4 to form pMV460/LZ. The 7574 bp 
fragment was llgated to the 1847 bp fragment derived from pKR9.4 
to form PIIV460 R/LZ. Plasraids pMV460 F/LZ and pMV460R/LZ each 
include a mycobacterial replicon, the L5 attP site, and the L5 
integrase gene. A schematic of the formation of plasmlds pMV460 
F/LZ and pNV460R/LZ' is shown in Figure 38. To generate 
derivatives without the mycobacterial plasmid replicon, plasmlds 
pMV460/LZ and pMV460/LZ were digested with NotI and 
recircularized by ligation to generate pMV360F/LZ and pMV360R/LZ. 
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A schematic of the construction of pMV360F/L2 and pMV360R/L2 
shown in Figure 39. * 

Plasmids PMH9.4. pMV261/L2, pMV460/L2. pMV460/L2. pMV460/L2 
and PMV460/L2 were then transformed into M. sn,e«n... , ^^d BCC to 
test their ability to replicate autonomously or integrate into 
the M. smeqmatis or the BCC chromosome. Transformation wi^h 
PMH9.4, PMV261/L2. pMV360F/L2, and pMV360R/L2 yielded kananmyc-^n 
resistant transf ormants of M. smegmatl. and BCC. Transforman^ 
of PMV261L2, PMV360F/L2, and pMV360R/L2 were shown to express r 
coli B-galacto.id.se by SDS-poly.cryl.mide gel electrophoresli 
and X-gal assay. Pl.smids pMV460F/L2 and pMV460R/L2 failed to 
yield kanamycin resistant transf ormants. thus indicating that 
Chromosomal integration of a plasmid carrying sequences mediating 
autonomous replication is lethal to mycobacteria. 

Ij Construction of oMV307. 

Pl.smid PMV206 w.s digested with NotI to remove the 
mycobacteri.l repllcon. The resulting 2209 bp fragment, which 
includes the .ph (K.n^) gene, the E. coll replicon and the 
multiple cloning site, was ligated and reclrcul.rired to form 
PMV205. the construction of which is schem.tic.lly depicted in 
Figure 14. 

PCR with primers Xb.I-Att/Int .nd Nhel-Att/Int w.s then 
performed on . S.l I fr.gment from pMH9.4. which cont.ins the 
attP site mnd the L5 lntegr.se gene. The resulting c.ssette w.s 
then digested with Xb.I .nd Nhel .nd . 1789 bp fr.gment w.s gel 
purified. PMV205 v.a then digested with Nhel, mnd the resulting 
fragment w.a lig.ted to the 1989 bp fr.gment obt.ined from pMH9.4 
to form PMV307. A schem.tic of the construction of pMV307 is 
shown in Figure 40. 

8. Construc tion of pMV261/HIVl-crp 120 . 

An Sm.I-Cl.I utigen gene fr.gment, or c.ssette, w.s 
constructed by PCR. Mid cloned between the B«n HI and Clal 
restriction sites of pMV261 to form pMV261/HIVl-gp 120. 
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Plasmid pMV261/HIVl-gp 120 was transformed into BCG, and the 
presence of the corresponding antigen in BCG was verified by the 
appearance of immunoreactive protein bands in Western blot 
analysis of BCG recombinant lysates. 
9, Construction of pMV361/HIVl'>a p 12Q 

The HIVl-gp 120 antigen gene expression cassette, which 
includes a promoter sequence and an HIVl-gp 120 gene sequence, 
was excised from the pMV261 derivatives with Not I and a second 
restriction enzyme site (Pvu II, Eco RI, Sal I, Cla I or Hind 
III) and cloned into the integrating plasmid pMV307 between the 
Not I site and a second enzyme site (Pvu II, Eco RI, Sal I, Cla I 
or Hind III) to form the plasmid pMV361/HIV-Igpl20 • The backbone 
of this plasmid is shown in Figure 41. 

Plasmid pMV361/HIV-Igpl20 was transformed into BCG and shown 
to express the corresponding antigens by Western blot analysis 
(Figure 42) with the appropriate antigen-specific human sera. 

Example 4 

Cytotoxic T lymphocyte responses to HlV-l-gp 12 0 

HIV-1 gp 120 was expressed in BCG as a six amino acid fusion 
protein with BCG hsp 60 protein using vector pMV361/HIV-l-gpl20, 
using the hsp 60 promoter to control expression. 

Two groups of mice were inoculated with 1 X 10^ CFU's of 
recombinant BCG expressing the gp 120 gene from the integrative 
plasmid pMV361/HIV-l-gpl20. One group received the BCG via 
intraperi tonal injection (100 ul) whereas the other group 
received the BCG by deposition of the dose (10 ul) rtibbed into a 
tail scratch. 

CTL activity was measured at various times after 
immunization. CTL activity was measured as follows: 

Two mice from each group were sacrificed at various times 
after immunation, and the spleens were removed. Single cell 
suspensions were made and the red blood cells were lyzed with 
ammonium chloride. The cells were stimulated in vitro for 5 days 
with P81S cells, that were pulsed with peptide PIB, a fifteen 
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residue synthetic peptide within HIVl-gp 120. A 4-hour chromium 
(Cr ) release assay was then carried out using untreated P815 
and peptide P18 pulsed P815 cells as targets. Significant 
Pia-specific CTL activity was observed in the mice immunized by 
taxi scratch 14 weeks after immunization. At 16 weeks CTL 
activity was observed in both groups of mice. Upon repeat of 
this experiement, CTL activity was observed at time points as 
early as 8 weeks after immunization. 

Example 5 

Recombinant BCG transformed with pMV361/HIVl-gp 120 were 
grown to mid-log phase in Dubos media and concentrated by 
centrifugation. The bacteria were then resuspended in 15% 
glycerol and frozen using a rate contolled freezing apparatus' 
The bacteria were stored at -TO^C until use (referred to as 
"vaccine"). A second preparation grown in the same way was not 
frozen and is referred to as a "fresh" preparation. Prior to 
immunization of animals, the bacteria were resuspended in PBS ♦ 
0.05% Tween 80 to the desired concentration and cup sonicated 
briefly to disperse clumped bacteria. Six week old BALB/c mice 
were inoculated with a single dose of 5 x 10* cfu fresh bacteria 
(determined post inoculation) or 1.5 x 10^ frozen bacteria 
(determined pre-inoculation) by tail scratch (t.s.) injection. 
At 8 weeks post-immunization, splenocytes were harvested from 
animals and CTL activity was measured (described below). 
Splenocytes from unimmunized animals were used as controls in the 
CTL assays. 

CTL activity was determined as follows: 

Splenocytes (ACK- treated, 5 X were stimulated in 

vitro in 10 ml in upright T25 flasks by co-culture for 5 days 
with mitomycin C-treated P81S cells (5 X lO^ml) that were pulsed 
with 250 ug/ml of peptide P18 for one hour. A 4 hr. ^^Cr release 
assay was subsequently performed in triplicate using P815 targets 
with or without pulsing for 1 hour with 250 ug/ml peptide P18. 
Various effector-target ratios were tested using 5000 
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targets/well. Specific lysis was calculated as follows- y 
specific lysis = 100 X (release by effector cells minus' 
spontaneous release/maximal release minus spontaneious release). 
The results are given in Figure 42. 

As shown in Figure 43, both groups of mice showed an 
increased CTL response at 8 weeks after immunization as compared 
with unimmunized mice. 

Example 

Recombinant BCG transformed with pMV361/HIVl-gp 120 were 
grown to mid-log phase in Dubos media and concentrated by 
centrifugation. The bacteria were then resuspended in 15% 
glycerol and frozen using a rate contolled freezing apparatus 
The bacteria were stored at -70«>C until use (referred to as 
vaccine"). A second preparation grown in the same way was not 
frozen and is referred to as a "fresh" preparation. Prior to 
inununization of animals, the bacteria were resuspended in PBS ♦ 
0.05% Tween 80 to the desired concentration and cup sonicated 
briefly to disperse clumped bacteria. Six week old BALB/c mice 
were Inoculated with a single dose of 5 x 10^ cfu fresh bacteria 
(determined post inoculation) or 1.5 x 10^ frozen bacteria 
(determined pre-inoculation) by tail scratch (t.s.) injection 
At 8 weeks post-immunization, splenocytes were harvested from 
animals and CTL activity was measured (described below). 
CTL activity was determined as follows: 

Lymph node cells (5 x loVml) were stimulated in vitro in 10 
ml in upright T25 flasks by co-culture for 5 days with mitomycin 
C-treated P815 cells (5 X loVml) that were pulsed with 250 ug/mi 
of peptide P18 for one hour. A 4 hr ^Ic^ release assay was 
subsequently performed in triplicate using P815 (matched ) or EL4 
(mismatched) targets with or without pulsing for 1 hour with 250 
ug/ml peptide P18. Various effector target ratios were, tested 
using 5000 targets/well. Specific lysis was calculated as 
follows: % specific lysis = 100 x (release by effector cells 
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minus spontaneous release/maximal rel^««. 

release]. release minus spontaneous 

AS snown m Figure 44 ^ 
ly».ph node cells was degenerated HlT"" """" ^ "° 
Wit. SCO .„n3.o™ed p jr^r;;'?,.,^™'""""^- - 

preset: lU:.,:: r"n:r;;°r!-. - — 

embodiments described above Th-^ • '"^^^"^ specific 
ct.e. .Han as pa.Ucularl. ae,"UTd"?«:r.r„.r 
scope of the accon,panying claim,. " '"e 
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WHAT IS CLAIMEU IS: 

1. A method of inducing a CTL response in an anima^ 
comprising: 

administering to an animal mycobacteria transformed I 
with at least one DNA sequence which encodes a protein or peptide ^ 
or fragment or derivative thereof which includes an epitope which 
is recognized by cytotoxic T lymphocytes, said mycobacteria being 
administered in an amount effective to induce a CTL response in 
an animal. 

2. The method of Claim 1 wherein said protein or peptide 
or fragment or derivative thereof includes an epitope which is 
recognized by cytotoxic T lymphocytes induced by an HIV protein 
or fragment or derivative thereof. 

3. The method of Claim 2 wherein said protein or peptide 
or fragment or derivative thereof is an HIV protein or fragment 1 
or derivative thereof. 

4. The method of Claim 1 wherein the mycobacteria are of 
the species M.bovi8 «BCC. 

5. A composition for inducing a CTL response in an animil, 

comprising: 

mycobacteria transformed with at least one DNA sequence 
which encodes a protein or peptide or fragment or derivative 
thereof which includes an epitope which is reocognlzed by 
cytotoxic T lymphocytes, and an acceptable pharmaceutical 
carrier, said mycobacteria being present in an amount effective 
to induce a CTL response in an animal. 

6. The composition of Claim 5 wherein said protein or 
peptide or fragment or derivative thereof includes an epitope 
which is recognized by cytotoxic T lymphocytes induced by an HIV 
protein or fragment or derivative thereof. 

7. The composition of Claim 6 wherein said protein or 
peptide or fragment or derivative thereof is an HIV protein or 
fragment or derivative thereof. 
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8. The composition of Claim 5 uh«*.-<- 

r wi«iin 3 wnerexn said mycobacteria 
are of the species M.bovis -BCC. 
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CI a I 7298 
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BamH I 7264 
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Cla I 6790 
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423 Sal I 
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2986 EcoR I 
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1 
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424 Sal I 
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•Nhel7l96 319 Nor I- 

•HinD III 6698. \ J L^^^^ Sol I- 




no. ^ 



2759 Ncol- 
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6407 184 Nor I- 



•X HinD III 5806 
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423 Bal |. 




1307 Bol |. 



•AlwN I 4453 
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•Spl I 3217 
•Nha I 6407 '?|4%",.'* 



1967 NcQ t. 

2193 EcoR I- 
2431 Bal II- 



2511 Sal 



I 



423 Bal I- 



•Aatll49I5 



AlwN 14453 




I 



PGR mutagenesis of Cla 5378 
end Hindlll 5806 sites. 
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I 

GCTAGCnCTATATGCTITGATXXIAAATTltTATTXXX^ 

1 ♦ ♦ ♦ ♦ + . 

aUTOGOGATATAOGCAACrAOTITAAAGATACXXXnrXXXXACwlGCC^^ 

Bern -M. rep . H^m^a,^ 

■•- I — oa 

CGAGa:ACTATCGAcrACGOGATCATXXXx;Aoc:A r^ofTYrrnr^ T.KTT rrt 
101 * + ♦ + ^ w 

a,iUA;iUATACCKUTCCccTAGTArrr^it><;iuK x xx:A(y;ACAC^ 

TlGlClGCULTOCXXXXJbl llXJb-ICC3CGGTXX:AlUGAGCOfiGGCCACCltX;A ^ 

AAC\G ACCCAGGGGCGCAACGCAGCXXXACCTACCItXXyx:COCnXX;A^ iP 

ATTOCUCOCAATCAATKnTCXXXUGAACItnXUATGOGOC^ > 
301---- — > 4. + 4... ii 

TAACCJUX;iTAGTrAACAACCantnTGACACTrAOCObl 1 IXASll XXXiAAC 5| 

401 * + ♦ 52 

OCTAGAGO0CCnXXXAOCCACGACaX3TCCCCACCXXrrACTA<^ 

GAATCAATCAaXUTACXXXACOGAACCTreAAGCCACTCCIUCi UIAAAAC 

501 ♦ ♦ 

CTTACTrACrrOCCTATOOO C ' lUiCUlU CACl i Ct^^^ 

TAAACrrCTXXIAAAOGOGGAACTCAGCXXaXTXXIACrATTATCTnCO^ 
601- - ♦ ♦ ♦ 

ATTrcAGACCITItXXX3CTICAGTOGCCGGA(XTtX7rAATA^ 

(Q 

TAACGAAGOCCIXXXrATTGACCXTGAGTCAl J 11 IClC'llX/llXXXXXXXIAT ^ 

701 ♦ ♦ ♦ 

ATItXnrCOCGAaXnVUCinGGACnZACTAAAAAGAGAOCAGGGa^ ^J 

TIXL^TCATGiCTAACCCGTATCX7roACCAnXlLlUl.^lllt ^ - 

801 * ♦ ♦ ^ 

AACTA(n\\CTCATTXX;GCATAGCACTCX7rAGCAGAC:ACX:AAAGTAGCCATA -p 

aiAAACAGGAAAAAAOCXXXXTTAACATGCOCCGCTrrATCAGAAG £ 

901 - ...... — 

OLl i lUlLUAi liTllXXXXXIAATroTACXXXXXXIAAAYAGTCr ^ 



AGACATtntTTCAATOKnTCACGACCAaxnGATCAGCrrrACrGCAG^ 
1001.. ....^ ^ ^ ^. - 

'lC]uiAGACACrrACCGAAGlXXnXXriXXX;ACTACTCXAAATXXX.t;ii;iH ; H 
CACTCAACCItXSlAGCGTGrcGTlXXXXSAGOCATCTAGCAAO^ < 

1101 ■»• * + * ^ 2 

GIGAGTTCGAACTKX^CACACrAACGCXntXXnAGATCGT^ 
GAACIXMCrcCTATXXTItXnVlAACXnAGTrrcACCAGCAT^^ 

12P1 ♦ ♦ ♦ ♦ 

CTCX^iCTGAGCATAGCAACXIATnXX^^TCAAACIXXTITXTr^ 

U&xu I T ~ ~ ■ 

ACGAGAGTCG0CACCGATTXXAOCACAACCACTArAACXr.ACTTTX^ 

1301 - - - + + ♦ ^ 

TGCTXnt L^OCXKr rCCCrACCCrCCIX;] ILVl U ATCTrGCCrCAAGCGCTCC 
CGAAATtXXnTCXn*ATtX;A(X:AAGATIXX7rAGAACCOCGTXnrC 

1401 - ♦ + ♦ 

CCnTACCCAACCATAGC 1 IX; 1 ' lUlAAGCAlXnTCGGGCAGAGCAGACXXJA 
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fig: SB 

^CTKTTCCGACCCCnrCCOCXXXXX^ 

nACAr^.cirrfxr.AAACcxxxxxxrxx7it^^^ 



100 



OmCC^ACCCAT^^ 

ACATCCaXXnCOtTTAOCiJi^^ 200 

OTAcmccmxMiiioJ^ii^ 3oo 
oncrro^AiiOTiiii^iii^ii^^ 

ACACCAACIXXntXXXXrATtrCAOCGOT^ " * 



ITOGT 

CTCTXXXUantUCCAACAACATCAATCX^ i IIJ L GI U HIU; 

AGACGCTtX:A(nXXnTinTtnACTrACxicAA<^^ ^ 

t ^f^^ypJ^^J'pfrccTxxxTAoccnnx^ 

^ ^^ACarnXTACCAOUOCGATCCG^ 

X 'CCATAOCGCCACmnTTACOnCACAACCITCXL^ 

H Wn-ATCGCOGTTUACAAATCXXUGTCT^^ ^ 

2 A^CCCaiATCAAOiGAAATrcCOCCmCAaXSACaiAT^ 

(nAATr^,OOClA LUGlLIITA AOTO ^ 
<^TT>UOCCTI^^ 

GTAATItXX;AACUantnTItUCTroCTO^ 
GAGGACAGTOXAOCUO^^ 

^^rccitncAGccix^TCcn^ "«> 

CCAACOCACTOACACmXXXXXXIAC^Tjre^^ 



TACATCACC4CAACCAaXM.TItnraXXnT:ACClCC^ 
CrrATIt:AAAACCGAaXX*aUAACACCCAAOCACACACGCATr.GOT 

ccATAAGi 1 1 iiAx;itxxriTG(nTnrro^^ 
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AAAOCAGAAAACTAOXnCTAOCjitxUCITmCCnnca^ 
13P1 ♦ ♦ + + 

1 1 iU>lLl J'JiUATCCCAGAICinanVUAAATGGACACGaiXXXXAa^ 

GAACAcocGTCCATTCTCCGc'i ici;i'iGiuxxxmTGAoc.u ; criLvn; 
leoi ♦ ♦ ♦ ^ ^. 

CITTTItXXXIACXTAACAGOCG/UGCAACAtXXrGAAAACn^ 
e F L P P N D A E NEAKQAAEQ °° 

T0CAGATCCAGCCXXUAA1I.1 i lUXJLUU liUXXXXXUUGlUGOCCrOGT 2 
17P1-- - * ♦ ♦ ♦ 

ACCIXTACCItXXXXnTrACAAAOOCCCAAAaailAAVriClC A CXXXX^ 
e LELCFMKATQPWSEGKD !£ 

oocAAcraxixxxnTcxntxxxxAOCAGccGr^cGAaTrc^ u. 

1801 -»• ♦ ♦ ♦ ^. _ 

CGGTG\OCG\CGCAAGGACGCOC7TXXn^XXXnx^CTGCAO^ ^ 
e NESREQAVLRVVIIRESL g 

GACAGTCSXXJCOOCXinCTAGCOCnCCC^ 
1901 — + ♦ + I 

CTXTICAOCOCUCGCXXAACATCOCCACaUCATCGGCAGaiA^ 

« V T PCRWYCDSYCDSYGDS 2 

2DD1 * ♦ + ^ . . 

CACACGAOCXX;aXX:AaXOCGAOC^OCXrOCAAGGOGCICTAO^^ 
d ISAATRAAAGKR SIASQ 
e HEGGWASSRGEALIIGVP 

OOGCGACl ilA/l Ji;ilMTCCAACXXX:AAATti <;iO ' riU CCCATCXXXXXX;A^ 
2101 ♦ ♦ ♦ ♦ 4.. 

OGCCCTCAACC^ACACrAGCnTtKXXnTTACCAOuaXXTAOCCO^^ 
dRSXTTIWRWISWAlAR V 
e A V Q NEDLA LEQQRERPG 

ccciTitxajcaTitxxiAcraxxATAGATTxxyyxx^^ 

2201 ♦ ♦ ♦ ♦ ^ 

CXX:AAAG0G<XX:ACCGTCAGCCC7rATCTAr^WXrr.G^ ^ 

^ N RARCEAYIARGLGDV % 

cRKARPVRCLDRPRTRGR j 

ACCIXLiaX}AATCXl\ACAGTtXXCAATIXXXXXXTACCGGOG^ 0 

TWSACrcCCITAGCTnTrcAaXXTrAACCaXXJATCCC^^ U) 
d RVSDFLACNRGLPTPAA 
e QRPRVTRLEARAADSGG ^ 

CAGCTtXXXXnXXATGTlXXXnXliGTGICTA^CAT^ ^ 
3101 * + ♦ ^ — 

CTXXL\GaXGAGCrACACXX;ACTCACACATCTCTACACnt>ant^^ L 
dLE AEIESLTYLDEIILWET 
e ARGREPQTELSRLPA MC 
S^,l^ C-TPCR MUTAGENOSISpHVII0-300s 

cixxxxnxxxxxnaiAiOTxocxxxsAAGGccrnxxxxx^ 

25D1 + + ♦ + ^. 

GACGCCAGCGCCAOtnr,CCGCGCGGCTItXGGAAGCrGa7Tt^^ 

d RI>GDVARRL CEAC A AMT 
cQPRRRRRASP RRRV 

TGACntXXXL\CACTCCX71X7ir.CGir^.CCCnt-^;«X!GATlt^^ 

2fi01 ♦ ♦ ♦ ♦ ^. 

ACTCACCGCltntl^CCCACACCCAaDCXXlAACaxrOCTAACGGGTCX^ O 
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^^cl^k.\xcACcclL^l^^.^L^.u.tuTAcax^c^ctx;lTAAax7ro 

AAGACCGCXTItXXlAGAAAGGAOCOCXXnATOCXXnTUGC^ 
KAAREKKAKYGLKKVTD 

anccTtUTAOwxxwuTtxxnrcGcccant^ 

< CCAGCACrATCaXXXXTACGC\A(»aXXxi[a^^ 
^ DEVARIRERRAAQEAl. 
AGIOCCGTGATItX^AGaXXTItXXXXXXXXntAa^ 

- CACXXXACTAACCTOCXXXlAAGOOCXXXOCAGTtXXxiG^^ 

GTIRAGEAATVRRKKR 

^_ CTCATAGCAATtxrrcc4iiKxntL^a;ccGAcrntxxx^^ 

^ <^TATanTACC(UCCTAaDCACrcCC(^^ ^ 

'LLASMASASKARRAV 
X MATCCESVRVKRAACS 

tXXXXXLi<nXi<GTX7nXXXna7rAGAOCACGATXXXX7^^ 

< 



+ 2KD 



CGOrCIXUCTCACAOOWUGCATCTOntXTAGGGCAGGOCGGTm 
RAS LTAEYVVI G DAWI 

ACQTEGRLGRDRGG LE 

G-APCR MUTAGENOSIS k o o l e 

VilO~300S IRocE 

ClCtAJlVllXXXJTACCCGTCCXXXiAOiCA a,lLXin 

GACaiACACGCXIATDKXIAGCCCLiUlO-lt X IAGCAACGlUJ U^ ^ ^ 

ESDPLPCPCVDNCPFEA 
RQ G T A T RS V R R QV P I R 

TirXXXXTIOXXIACCrAGATCaX^TtMCGGCCOGACKATAGGCC^ 

CAAGGCCCACCanrXZATCTAGGCGTACTCCO^ ^ 
XRTP LY IRMLA PRYAWL 
DQ E P P A P L DAEPRSSLGVv 

^< 1 luiALi iu;iuiu;iCAaxx:4oaxx3coGcioocA ibi nju c m: 

0 CAAACATCXIACCAC^aiACTOCGGITXXxia^^ ^ 

l[) KYTTQQRWRPPPMWAG 
QVEDASALAATARERR 

Q GTrnxXXUGGCCATCnGGOCCGOCnTriTCCIC^ 

r <:AAAGGCixrccrA(:AcaxxraiAUAc^ 

IL EWAIII GPMKTML CS Y S 

NGLRWP RP KQDEPR L L 

ATACCCTtXKXCAATWXXX3CCGCATAAGCCACC^ 

ALPKRRAYETRPVP a IV 

— ♦ ♦ ♦ 

GCAACCccnccocTACOCionarjrmTxriAr^ . ^ zwo 
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Fi o. y 



BcimHT BcxhoHI 




3648 Sol II' 



IIIZEcoRV 

824Cla.I 3079BcLmHl 
IB am HI 3005EcoRT 



1 1 



Bo-mHI lacZ CASSETTE 
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Fl O. 9 



wne I 6407 ^or |. 



423 Mac I- 



•Aot II 4915 



AlwN I 4453 

•Nde t 3864 

•Spl I 3217 

Diqett with Norland Ball, 
nil in,Ugal«/R«circulori2« 




1307 Bal I- 



Mw I l46BomH I 

•Nhe I 5285 |84 Nor I- 




845Ncol- 
1071 EcoR |. 



1309 Bgl II 
1369 Sol J 

Aft II 4271 



1967 Ncol- 

2431 Bol II- 
Sori- 



Digest withNdal end Spl I 
Fill in, Ligoto /Reeirculo r ize 



Mh. I BomHI- 
•Nhe I 5763 134 Nor I- 

,423 Mac!- 



AlwN I 3808 
2096 Spl I- 



•Nde I 2742 




1307 Bol I- 



Digest with BomHiaEcoRI 
ond Isolate 1071 bp fragment 



Aat II 3149 



-AlwN 1 2667 



'^•'"^ f-|967Nco|. 

2193 EcoR I 
2431 Bgl II 
J 2511 Sal I 

— ^Ligate-*-Dige«t with BamHI a EcoRI 
i and bolof« 3570 bp fragment 

Nhe I 4641 W6 BomH I- 
Nne I 4641 ,Q4^j.,_ 



845 Nco I- 

1071 EcoR I- 

1309 Bgl II 
1389 Sol I. 
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61 



-j / 



Nhe i 4841 



l46BamH I 
184 Nari 



-Aat II 3149 



PCR mutagenesis 
ofEcoRI 1071 



Nhe I 4841 '^^ BamHI 




Fl O. lO 



845Ncoi- 
1071 EcoR I 

1309 Bgl II 
1389 Sal i- 



•AlwNl 2667 
•ApaL I 2590 




PCR mutogenesis 
of Sail 1369 



i46BamH I 
184 Nor I; 

Nhe I 4841 



845Ncol 



1309 Bgl II 
1369 Sail- 

Aat II 3149 



AiwN I 2687 
•ApoL I 2590 



Digest with ApoLI* Bglll. 
Isolate 3360 bp fragment 





•AlwN I 2687 
•ApaL I 2590 



Ligate 

•Nhe I 484! '46 BamHI- 
Nhel4S4l 184 Nor I- 



Digest with ApaLI-*- Bgl II. 
Isolate 1281 bp fragment 



Aat II 3149 



AlwN I 2687 
•ApaL I 2590 




845 Nco i- 



1309 Bgl I/- 
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Bel I 4101 
^ Sfll 4084 
Spl I 4002 
Hpol 3993 
;Sal I 3984 
•Clo I 3976 
•EcoR I 3966 
•Pst I 3960 
^•Pvull3957 
•BamH I 3952 
•Nco I 3948 
•Xbo I 3938 

•Kpn I 3922 
•Not I 3914^ ■ 



Fl 



/ 

O 



1^ 



207 Nru I- 




•Bel I 4115 
Spi I 4016 
•Hpa I 4007 
Sail 3996 
•Clo I 3992 
•HinO III 3986 
•EcoR I 3979 
•P»t I 3974 
•Pvu II 3971 
•BomH I 3966 
•Nco I 3962 
•Xbo I 3953 
•Dro I 3947 
•Bglll 3942 
•Kpn I 3936 i mk- i 
•Not I 3928 «Nhel 



^ Bel I 2044 2011 Mlul. ^ 
/.Spl I 2105 

•"«Pf» ' 2096 Digest Mlul Bgl II 2797 

•HinD III 2075 
EcoR I 2068 
•Pst I 2063 
•PvjJ II 2060 
•BomH 1 2055 
•Nco I 2051 
•Xbo I 2042 
•Oral 2036 
Bgl II 2031 
•Kpnl2025 
•Not 1 2017 
•Mlul 2011 



Digest Not I 
Ligate/Recirculorize 



INhel 





2011 Mlu I- 
2017 Not I • 
2025 Mlu |. 
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IT J H / 

TCCCTGCTGCCAGrCGCGATAAi^lUJliJlVl lACOCGCTrcCACTCAA GAC 

1501 + + ♦ + + . 

ACXXUCGACXXTrcAOCCCTATTCAGCACAGAATGGCOCAAC^^ 

CACACACaXACCTlXXUCOGAACCACCrACAOCGAACTCAGATACCT 

1601 + ♦ ♦ + + 

CJGTGiaXGlKXiAACCrCGCnKXnCGATX^^ 

AGCTATCOGGTAAGCCGCAGGGTTXXUACACXUGAGCGCAaUGGGAGCr ^ 

1701 ♦ ♦ ♦ ^ 

TCCATACGCXIATTXXXXXrrcCXIAGCCTItTIXXltnXXXXnr,^^ !f} 

TCTCACITGAGCGTCGAi 1 11 lOlGATCCTCCTCACGCGGCCGGACkDCrA 

1801- ♦ ♦ ♦ ♦ O 

ACACTGAACTOSCAGCTAAAAACACTACXUCCAGTOXXXXX;^ J][ 

CCCITITGCTCACAlOilL'iriLUlGCGTTATaXXrTC 

1901 * •"" ♦ ♦ X 

CGGAAAAOGAGTVTrACAAGAAAGGACGCAATAGGGGACTAAGACACXrrAll I- 

EMP E.rep _ BE<a tM hA . rep ^ 

ACOGAGOOZAACGOCnXWGCCCCCACXXrffrcAGaXlAC^^ X 
2»1 * ♦ ♦ ♦ ^. \^ 

AOCGGTICTAAGGOCGanCTAaXXXXXXACACCXXXnxnXlAG 2 
2101 ♦ ♦ ♦ ^ ^ 

tgcccagatickxxxxxm:atcccccccc 
' ""^CGGCTitxriccocitnaxTOTrcT^ 

... ACCCCCACX;ACCCCAGiCOCACCACAAr/nr,CTr/^^ 
ACCnx;ACCACAGCCIXXn!ATX7ItXXX:ACr^ 
Cr^^CCCTltr,CTtXTA(^ 

A.\CCTGCllA;iLt;ilXlMXnXGA(XA7XX:ACA(XCAGCCCI^ <f 
2501 ♦ ♦ ^ ^ ^ 

ITRGACGACOlCCACCraiATXnTXrrACGTCTCXXnxrX^^^ X 

0CAA0GcccAax:ACACccA<rTX7nxxx:Acit:AACC(xccrCT ^ 

2601 * ♦ ♦ ♦ if) 

ACXXXnTtXXXXXXXXXnX.X;AlTXXX;ACCXX:AClTACir>lGG<TTT-AlT;A 
2701 ♦ ♦ ♦ ♦ + 

TCCCCAACCCCCCCCCClCCrACCCCTCOCGTCAATCAGTCCCCACTAC Q 
CTCTACACACTCAGOCACATCCACCCCCAGCTOXKXXXLUCAT^ 

GACATCJIXrnUGTtXXTItrrACCTOCGCCTCGACCCCCCCr^ ^ 
GCCGGAATItXXX:ACn7m:ATTC0CTCAGGTTXr^ 

2901 + ♦ ♦ + - 

CCGCCTTAACGCCTGACAAOCTAAG<X:ACritXAACACXrGGATAGC^f^^ 

CCCCATCTATtXXX;AGTr,CCAC?GOrX-CA.\ArGrrGAAllir<XTrx:A.\C^ 
yx\\ ♦ ♦ ♦ 

GciK:rAGATACGGCixi\cGcrixxxx:<x:rrixx:(;<x:iTAAA(X'/:Act7iT(xr 

AGCATrnXXX7mX;ATCACAACCAAGTXXX:GCATnTXXK:GG 
3101 ♦ ♦ ♦ ♦ ^ 
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< 

o 



6 
in 

0 



GATACTrAarCCATAACCCCCACOCCTCGGCCrCAAOCCCXX Xjl lUVii; 

♦ ♦ ♦ ♦ + 1600 

CTATCAATCGCCTATIXXXKXTiaXXAGCaXACITCXXXXr 

GCGTCAGCATTCAGAAAGCCOCACCCTICax;)UGGGAGAAAGGCaXUC 

♦ — — + noo 

IXXXIACrcGTAACltnTTCXXXXTITXXUAGCGCTIXXriCll lULGCCTG 

TtrAGGGCCAAAOoantxnxTCiTrATAG^arroitx ^ 

♦ + -4 + -f 1800 

CGrrcCCCCTITGCGGACCATAGAAATATCACCACAG(XCAAAGCGGTC^ 

TCGAAAAACGCCACCAACGCCGCXriil llACOTlUL'ltXXCl I'l'llXJli; 

♦ ♦ ♦ ♦ *isoo 

OCmrroOC O ' l U* i lG CCOCCCAAAAATCOCAACCACOCGAAAACGAC 



I CCGTATTACCGCCTTC ACrrCA CCIXL^TACXXXnXXCCGCAGCCGAACC 

t — * ♦ ♦ + 2D00 

^ GGCATAAlGGOXUAACrCACntX^ACrATCXXXIAGCGGOGICGGCI^^ 

z 

u 

y ♦ ♦ ♦ + 2100 

ACC(XXXX1ACACA(XXjAOCATCCOOGCGTAAGICVGL^ 

GAAACGTCCrcGAAACCACCCAll.lUl ILCllXiU/lilUGTACAGGlGGr 

♦ • ♦ -» ^.........4 22D0 

CI 1 ilXZAGGACCi 1 IlX,'llAX;iACACAACGACGACCAAOCATt7rcCACi:A 
ACOCGGGGAGTGTOCAGTRTrooaTroGOOCaO^ 

* " " * " " " " " * " •♦•••••••••♦•••••••••■♦.--«--. .-.4. 2100 

TCC^CCCTCACACGTCAACACXXXJiOXCGCAJJlCCC^ 
CC€7TGACCCACCmXAXXACC^AT^CAGC^CCl^^l\^ 

(2 tXXIACrcCGTOIAGCCCCTaTrAAACnxn^^ 

a71TTXX:CXTAGCOCXXXXX7rACATDGACOCGAACXXAAC\CCCCIt^^ 

ttiAr^cccATmxxccccATtnAantx^^ ^ ^ 
"'"***""* *********'♦'**•**•"•*•♦"••••"•*•••♦•-••• •------•f2tn) 

TXXXXXXXXXXXZAGCCTACCOGACGCGTnXXXTACCAOCanTAOCCC 
.CaiAATAOXXXXXXXTAACOCXXnXXXIATACATOGCOOOCTCXXXrCA 

— TGGCTTATCXXKXXXXXATiaXXXACCCT^^ 

CCAAAA,\CCrCCGCCACATCCCXnUCCAAACCGAATCXXnCCACTOiaVT 

...... 4,....,... .4..... 28X) 

ixnTmxxxxxxxxnxnAcaxiAcanTitxxTrAcocACcroAGTC^^ 

GCaXTGCTCGCGTCAGCAGACCACCTACAAAGOCGCTOCXIACXXZaJ^ 
* ♦ + 2900 

CCCTCATCCCCATCTACCTGCCCACCCOCAACCTt^ 

♦ ♦ + - ■♦■ ---•♦•300O 

GGGAGTACGOCTAGATCGACX^GCnXXXXXnTCXIACXntXX^ 

Aa;U. lC.TOCCCGAOCCCTAaXGACACCCACGritrCCCOCATtXXXAA^ 

♦ + + ♦ + 3100 

lXXj\CACAGGCCCITXXi;ATGGCClCIUX7CCACGCGOCGTAGOCGTrC 

CX7ir,GriCTACX;AGCCCACACrcAGTKXXXXX:ACirXXX^ 
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^1 \ 



w7 

ClXnrXUlXXTICTACCTOCACXUCCACCACGTnXXXACOCCIXnTC 

2roi ♦ ♦ ♦ 

CACAOCTACCAGATCCAGG li:^llA»lLt;iU: A AACXXrrCCOGACAAC 
dDPNDVD VVLLNALATP 

ATDOOCrCGAGCAGAHJO mA .1 l U OCAGCGCOCAGTACCGCAGCCA 
2801 ♦ •» ♦ ♦ 

TAGCGGAGCIXXrlL'lACCAGCXIiUCCCTCCaXXnCATXXl^^ 
d GELLDDSALPWYPLW 

TAATCAOIXGriinVlTKnOXACAOCAOnaUAC^ 
2501-- 4. 

d ATIACItXXX^CATAaiAGGCTtnXXnTXACCmiACICTATA^ 
TVPTNDSVLE LDSI ES 

TTUTGAAACACCAClXACAGCXXUCCACaXXXWXACCICT 
3001 ♦ ^ 4. 

AcrAciiJuiw,iLu.joiuxriu.iiA.uA;iiwiix:ACATGcrrc 

dS S VG AVA SC GWGG TC 
CKnPCXXXrajTACACC CUUX lTACA UXX;il ' A C 

310>-- ♦ ♦ ♦ ♦ 

GAGACOCXXXXXIATGIXXXXXXXAATCTCCGCAA^ 



CTCAOCTGCCmATtXXXJTAOGAATTXGCTGTtTItX^ 03 

3»1- ♦ ♦ ♦ Si 

CAGTCCAOXAAATAOCXXZATTXnTACCOCACACACXXnGG^ «n 

JJOK ♦ ♦ ^ 

CCTOccccciujuxxxxaxxccACAcrntrntxxxx^ ^ 
cxxxxxxuTCACcxxxxxxrrrAOOcTcxxTCa^occar^ u_ 
3«r ♦ ♦ • ♦ ♦ ^ 

CCaXXXirACTaJGCCOSGJUiTOCGACOGACCCTCXXXA^ ^ H 

ACCOCrOC UJUi A IIA ACGCTGA AI i lUlLlLt-\; AATCCAACroG h- 

3»^ ♦ ♦ * - 

imXiiGOCCGAAAAATIXrCACTrAAAaUACACACCCrrACGTrcAO > ^ 
ACACATGAGCAACITOCATAACi;] lUIJUXTOSATUrcCCrcGOCOGTT -r 'fl 

3a>— M----* ♦ ♦ ♦ ♦ i — 

TCnxnAOraTGAAGCTATrcCAAGAGOOCACCTAGACCC^ f- 

ACAGCl lUlUlUiAAGOXUTCOOGCXIAGCAGACAAGCaXTICAGGGCCO < rv 
3?D1 ♦ ♦ ♦ ♦ ♦ 2 \J 

TCTCXi^ACACACATItXXXTAOCXXXXnXXXntnTCCGGCAGT^ _ 

GCGATAGaXACItnATACTtXXnTAACTATCCXXXIATtAGAGCXGA ^ 
380h ♦ ♦ ^. 

CCCTATCCCOliACATATCACCGAATTCATACCCCtnVlCTCTCiC 

AGAAAATACCGCATCAGGOCClt HI.UA.A ILLlC U J l CACrSACTCGCTC 
390 1- ♦ ♦ ^ ^ 

ICl 11 lATGGCXnAGTXXGCGAGAACCOGAAGGAGCGAGTt^ 

TAATAOXTmTOCACAGAATCAGOOGATAACGCACCAAAGAACATUnUG 
40)1 ♦ ♦ ♦ ♦ ^ 

ATrATGCCAATAGCTCTCrrAGn:tt.t:rATri A .Gi L L l 1 ILlll/l ACACrC 

i;i 11 1 ILlATAGGClXXtXXXrca/lAOCACCATCACAAAAATCGACGCrc 
4101 ♦ ♦ ♦ ♦ + 

CAAAAAGGTATXXX;AGOCCCGCGGACT«niCCTACTCriTrrrACCT^^ 

cccxxnxx;AACciax.iLX,iocGcii;iLi.iui iLx.t ; ACccrGCCGCTrAC 

4201 ♦ ♦ ^ ^ 

GGOTACCriTOAGGGAGCACGCGAGACGACAACCCrcOCAaXXXMATO 



wo 92/21376 



PCT/US92/04538 



J<;iiUXVILXl*TCTAOC«XXXXXTACGGCCGACXXXXX^ 

■ "*■ * "*" ♦ ♦ - * 2800 

CAAtXXXACCTACATCCCaXrGCATCrODGCItXXXXXX^ 
MAEI YRRGLASARSQRYI 

lACCTCCTCAAATTCCTCGGOCACXritXXnTIACGCT^ 

•*• ♦ ♦ ♦ -f 2900 

rrcGACGACrrrAAGCACCOCCItXUaXLlGTtXXSAAOGiTC^ 

LQE FED AVMSVS PLLC RN 

TCACXXXXXACCaLiCAACTCCACACTmxti ; ^^ 

iCTU:tXtiO-lUX<7IX7nCAO(nurcACGCGGOCAGACGOC^^ ^ 

^ ^ ^ ^ ^ ^ ARGD LGS 

CAGGAGGAACACATCOOIU;! 1 ltJL.AGGACL.1 1 lU.'OGGCOCCTAAGAGCtX; 

CTCCT0CTIX7IXnACCCAGCAAAGCnxrcC^GGCCaX»^ 
V L L F V 

CTtlAATCaXXXKnACGAGOCACACAGCACaXUACITACGGAGCr^^ 
^ ♦ ♦ ♦ ♦ ♦32D0 

M.rr».Mla 

«o m 



(D 
•0 



0 



CAG(7rcAGATAOtXXXnACTCACCCltXX:AACGGCXUCACACCC^^ 

O ♦ • * ♦ ♦ ♦ ♦3300 

~ CTCCACIKITATCCCCXIATGAGIGCCMXXJI lLt.tJl*Cli;ii;iUW»CGCCro 

A(XACCAGGTC7mXAGGCTTCXXntXUACItX:AGCAC^ 
-r * * ******** ♦ • * * • * • ♦ ••• • • • " • 

-L nxx7rcxnxxACAAGCixxx;AA(xxucciTCAC CTc t: ioTA GCA U.tx 'ri tx : 

— TIXXTirGCAGOCCCItXIUXXXTITACACGCOCTCaXJI^^ 

5 ♦• * +3SD0 

CAACXLiOCnXXCOCAGOXXXrAATCTOXCGACCOC^^ 

^ CITCTXXIAACCCntTrATCrACXXnTAGTOCAAACrrcAAAOCAOGCGACT 

< CGAACArATnXXXACATAGATtXgAATCACGTrTCAA Oi 1 lUJlCCLUlA A 

2 ICGCnXUTCACGCroAAAAanOGACACATCCAGCTCaX G ACAO^ 

........ . 3300 

AGCCACTACIGOCACTriTOCAGACTCTUrACXTnXACXXXXn^ 
Cnx:ACCCGGTKinCCCCCGIGrCCCGOCX!CkCUSKli^^ 



ACJCCCCC ACAACCCOCCACAOCOCOCSCGICOGTA 

SI I CX*c,d,M 

ACTCAGAGTtXlACCATATCXXXntTItUAATAOCCCACAGATCC^ 



o 
o 

I 

5 

2 



Q 

UJ 
-J 
UJ 

O 



""" * ""' ■*■ ♦ . - ♦ . 4-3500 

TGACTCTCACCTXXnATAOCXXIACACTTr A TX X XXTU 
CXXJILXjG'IU;! lUilCCIGaXXXUGCCCrAltlAGCICACICAAACGOCG 

♦ 4. ^ 4(JJQ 

GCCAGOCAGCAAGCX^GATrATITIAOCATXXncrAGOGA Ol 1 HJLIU AGT 
CAAAAGOCCAGCAAAAGAaiAOGAACOGrAAAAAGGOOGOCTIGCnXX: 



^^4100 



IL GTmxrC GlLGU 1 ILHiOlLCrilX XI Al 1 IIIIXXX XXXIAAOGACOG 

AACTtlAGAGGTXXXXLiAACaXACACGACrATAAAGATACCACCOCTTr 

•* • ■*■ +4200 

GTIX:AGTCTCX:ACtX;CllllXXXntnTXnCAT AlliUlA TC^^ 

CGGATA Cl.ll*r a: L t L i 1 I CIC LCl ILtX XIAACaTrGCaKnTrCTCAAT 
---•♦■---------•♦•--------- + -- -- -----■♦•----.....4..- ....4 4300 
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«3> I V7 

AGGGC0CAGCAGOCaX:AaXXXXXX:AGCACAG1TCXXXXXXXXXCAAAG 
220] ♦ ♦ ♦ ♦ ♦ - 

Toccccoiunaxrcancccxxxcxnxxno u cAACxxxxxxxxx x; 



OKXTACAGaSAOGGCXACAACOCXXIAGCaiACrcrnXXXIAAAA^ 

3301--- ♦ ♦ ♦ 

GCCGAT(7ICGCTGOCGATGTrGGC0GTCXXXnGA(^ 



GIXXriXXXXXnX^GTCGCGCAGGAAOGCAGCGAGTCXXnCGOCGAC^ 
3401 ■»• ♦ ♦ 

CAGCAGGCoGACCAOCGu;iLx;i ii/xnxr.crcAOCGAGCGGcii:c;iccc 



CCCOCCAAACOGCCAAACATnCCCXXIICCATClCCM 

35D1 ♦ ♦ ♦ ♦ ♦ 

CCGG OUl A iUXGGA 1 AUl AAACOCCCACCnAGACCICTCGCAATTCCCr 



CAGGTAAAAGrrcCIXXTTAGAaXTAGTrnCIXXTriT^^ 

37D1 ♦ ♦ + 

GrXXX:ATTITCAOGAOplTCrXtXXU1CAAAA(UCCAAAO(XGG^^ 



u 



o 



AAAGGOCTACAAOG^GCaLkCAATCCACrGCTGTIXTAAO^^ 

3601 + ♦ + ♦ 

TIT0CCGO' i O i A U ,i'itUXnX7ITACGTCXXX;ACAAC A ri<KX;il^^^ ^ 



X 



X 

u 

CG OUlt:A ACCAAl C 4' AU GUCt U TACCAACOCATmXXXTOUTATCG < 

3801 + ■*■ ♦ ♦ 2 

(XCAAGATGCmGAACCAGCTAltXnTC(Xn'AAAGGCGACITATAGC 

MaUtplcCloBlagSUc 

S B 
N Kg 
EadKLrcp o c pi 

I I ■ I 

I I II 

TTGTACTIXTITGltXnXX^TCXXnYXKX^CGGGCGOGGTACCAGATCTn 

3901 ♦ ♦ ♦ 

AACATt:ACAACACCACCGTAGG<>CCGCXXXXXXXOCATGGrrCTAGAA 

S 

Stop Coadons p Begla TraaaerlpUoB Tcrmlaalor 

3 Frames I 
1 

GACOTAGITAACTAGCGTACtSATCGATCGOCAGGCATCAAATAAAACG 

CnXXIATCAATTCATCGOiTGCTAGCrcACGGTCCCTAGTITATITT^^ 
S 

S O. B 

r c e 

111 

I I I FIO. ISoA 

CAlCATO0aXXXX«IGAtCA 
itini ......... 4. ......... 4.417n 
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FIO. IS 



7aXXX7rcAGOCATGCATtX;4GGCATIGCTATCAG(XAC(XXT^ 

♦ ♦ ■♦■3300 

GGOCCACICGGTrACXnACCICOGTAACGATACTCGC^^ 



CCTGACOCCOGCOGAAGGCCCTDGAATCACrCGACTATCC^ 

+ ■*■ + ♦ +5tt) 

GCAClXXXXXXXXXnTOCGCGAGCmGIGGCXTGATAOGCITG^ 



^ TCG^CCCOGCGAACCC^TOCCCCCCrATtACX^CCACCLWXXX^ 

■*• ••••♦•35D0 

^ ACGTCX:GCax;illXX;iAGGCC(XX^TA(ntX:TGCiU.l<XUA;ii;AGAA 

0 CIXXXXnATCXXXXX^AGGAAAGACCCritXXXXIAGAACAGGAAGOGGC^ 

77 •♦• — ♦ -i-aeDo 

GAGax;ATAGO00GCUtX-iilC'iUX:AaXXX»lUli OAU;riUX XXMGT 

X ^xuGCGGcnxnasccGGGG'i'iux/iiucccG Gi lUA/i i tx :ucGGCiaxu 

b ♦ ♦ ♦ ♦ +3700 

^ COCTCGCCCACAGOCXrCCCAAGGCACXXXrCAAGCCAACGTIXXXCAGCCT 

1 

O CrcrcGTTCCGTXnTItXnTGCGCCCCriTITC 

b - + ---- ♦ - - ♦ 4.--.--.-..^ -^3800 

^ CAGAGCAACGCACAAAGCAACIGOCGGCAAAACmTCGTCGGIXnXXntnXX: 

G<X;AGCTCACCGCCAGAATCGGTG<7ITGTGGriGATGTACXJ^ 

♦ + + ♦ -^3900 

tXTTCCUCIXXXrCCTCTTAGCCACCAACACCACTACATCCA tU^ 

H 
1 

E B P E B 

DXcN vPc d CS 

rb ocm .a m I I 

R o I! I tR I I 

I M VII I II I II 

TAAATCTAGATATCCATGGATCCAGClXXIAGAATIXXUAGCrrATCCAT^ 

• + ■*■ ♦ ■♦■ ■♦■4000 

ATrrAGATCTATAGGTACCTAGGritXACGTXTrAAGCTrCGAATAGCTACAG 



AAAGGCTCAGTOCAAAGACTCGGCCrnXXriTrrATCTOnt^^ 

■* * + ♦ ♦ +4100 

TTCCGAGTCAGCl 1 IL'IGACCCGGAAAGCAAAATAGACAACAAACAGGCCG 
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Fl O. 13 B 



90 



AAGATAAAAATATATCATCATCAAC4ATAAAACii;iUlU;i'lACATAA 

+ ♦ * ♦ + 100 

TICrATITTrATATAGTAGTACritTITATITnUCAGACGA^ 

GAGGOCGOGATTAAATrcCAACATGGATCXnXUmATATGGCTATAA 

♦ ♦ ♦ + 200 

<ntXXXXXXTAATrTAACCriTCTACCrACCACTAAATATAC(X^ 

GGGAAGaXXATGCGCCAGACITGTITCrGAAACATCGCAAAGGTAGC 

+ + + ♦ ♦300 

ACCCTItr/XKTTACGCGGlCItAACAAAGACrmnAC^^ 

TITATtXXTCritXXIACCATCAAGCATITrATtXXn-AC^ 

+ ♦ ♦ -f •♦•400 

ATACGGAGAAGGCItXTTAGTrcCTAAAATAGGCATGAAGGACrACTAC 

AGAATATCCItUTICAGGTCAAAATATItnTCATGaxntXXIACTt^ 

+ + ♦ ♦ + 500 

TXnTATAGGACTAACKXIACriTrATAACAACTACGCCAOCG TX^ 

CCCrATllLt,lLiLlXTCACGaX:AATCACGAATGAATAACCCTrro 

...^ + ♦ ♦ ♦ 600 

GOXIATAAAGCAGAGCGAGTCOGOCnTACIGCTrACmT^^ 

< TCTtXSAAAGAAATCXZATAAlCTrmXXATICrcAa^ 

«n ^ + ♦ + 700 

GACC Cl 1 IL'l i lA CGTATTAGAAAACGGTAAGACTrGGOCTAAGTCAGC 

0 ATAG<7rTGTATrcATCTI«;ACCAGT0CX;AATCGCACA0CCATACCA 

— ♦ ♦ ♦ ^800 

^ rrATCCAACATAACTACAAOCnXnt:AGOCTrACCGTXnXX^ 

GAAACCKXTrrrrcAAAAATATtXnATnUTAATCCrcATATC 

1 ♦ ♦ ♦ + 900 

t lX:i'lilAXGAAAAA(7riTmTACCATAACTATrACCACTATACITATn' 

CXriTGTAACACTGGCAGAGCATrACGCTGACrrGAOGGGACXXXXKXT 

♦ ♦ ♦ + + 1000 

J£ CX:AACATIt7It;Aa.t;iOXXrrAATGCGACTCAACI XXXXn tXXtXX^ 

< cCGACAACGCAGACOGTITOnWXZAAAGCAAAACrTCAAAATCAOC 

2 ♦ + * + +1100 

GGCIXnTCCGTCrCGCAACGCACXA;i'l ltX;i 1 1 ILAACmTAGTCG 

GCTCGATGATCGGGOGATTCAGGOClXXnATtUCrcACCAACAOOT 
^ ^ ^ - + 12J0 

CCGACCTACTACOCOCCTAAGTCaXJACCATACTCAG^^ 

CATCAAAGCAli:! lUl IGACAltX-l l l 1 1 1 ILlCCGCCTAAiCl^"!^ ^ 
CTACmCCTAGAAC^ 

GAGCTACCAA Cli;i i 1 1 lUl G A AGCTAACroGCITCACCAGAGOGCAG 

+ . + + •••;•••••• 

CTCGATXKTITCAGAAAAAGGCTnXATrcAOCGAAGnXnxnt^^ 

ACnnXTTAGCAaXXXTACATACCrCCCnnXXTAATCCIOT 

+ 4. + -'+ ]SJO 

IGAGA<^TOCrrcGO(X;ATGTATGGAGCGAGAOGATrAGGACAATXXnC 
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FlO. ISA 

N 

I 

GCTACXXAAOiAAGOXlACGmTIXTIXntX^AATUnriXU 
1 ♦ ♦ ♦ 

CGATCCGTIXTITCCCTXXL^ACACAGAGTITTAGAGACrACAATG'rAACCCTG 

ACACTAATACAACGGGTCnT/aEAGOCATATnXACXXXX^^ 
101 ♦ ♦ ^ ^ . . 

TKnCATTATCnrCCCCACXATACrcCCTATAACTrCCCCITI^^ 
Kl 

I 

201 ^JIJ^J^^^C'I^OW^CATAATtncc^^ 
TACXrCAGCanATTACAGcixXnTA 

CTirXXX^TtUTUmCAGATOiGATCCTtUCACTAAACT^^ 
301- ^ ^ . . 

CAAaxnTACrACAATTTICTACICTAOCAGTCTCATI^ 

401.^?!^??,^i???f^^ 

CTAOCAATCACIXXTnUCCCTACXXXXCCI^^ 

501??!?^^^!?^?.'^™ ^ ''^ 1 i OTAATO T OLl 11 T AACACCCATC 
OOACOCaaceAAmAAOCTKMOACJU^ 5 
ClTGATKXliCTCUTTITCAlXUO^ ~ 

CAACrACCCItACTAAAACTACItXnXXXlATACCCAOCTCAC^ O 

701 

ACItUCTAOCACTAAAGAGTcUACTATrccilT^^ 

80I™™^^'^T?^^'°^^^ t 
CCTAGAACCCTACGATACCITCA*^^ ^ 

AAOCTOUUCTTAAACTAtr^ H 
TIXnTTLUTAAATOUACrmr^ 2 

♦ * ♦ ♦ . 

AACAACmTmCCTTCAAAAOCACrcAACTICCrACrK^^ 
AACTOnaiAOCTACAACAAAaTItntlATCAACOCn^^ 

"0* ♦ ♦ ♦ ^ 

cfMu f\ArM(^b£UL jg.f; s BEGIN E.RAP 

i2Di?^!!^.?!iV3J^P^^^^*^ 

GAAG I^TLLUIUIUCAGTCATC^GCK^CTC 

1X1 ^^JJ^^^^^^^^^^^^^'^^^^^^^^'^ '^^'i I »*1 iH>C CGCATt:AA 

GAALUl 1 iUl 1 11 11 10C7XX XXATCXntX;CCACCAA^ 

ATACCAAATACIXnanTCTACTtrrACXXCTACmrXKX^ 
1401- + ■*■ ♦ . 
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• Bel I 4500 
Sfl 14483 
-Sol I 4383 
Clol 4377 
HinO III 4371 
•EcoR I 4364 
Pst I 4359 
•Pvu II 4356 
•BomH I 4351 
Xbo I 3953 
Dro I 3947 
Bgl 113942 
■Kpnl3936 
Not 13928 



FIO. IS 



INhal 



•Bgl 11 2797 




1219 Spa I 



2017 Not I 



I BamH I- 

kmm 



1112 EcoRV- 
e24Clal 



3079 BomHl- 
3005 EcoR I 



Digest w/BomH I 
Ligat* w/BamHI LocZ cossetts 
Trontform E. coli 
Select blue colonies on X- gol Plates 



BomH I locZ cossette 
3064 bose pairs 



I 



•Sal I 7461 
Clo I 7455 
-HInD III 7449 
•EcoR I 7442 
•Pet I 7437 
•BamH I 7429 
■EcoR. I 7356 



INhel- 




•EcoR 
V 5462 

Clo I 5174 



BamH I 4361 

•Xbo I 3963 
Bgl II 3942 
•Kpn I 3936 
Not I 3926 



1219 Spe \- 



EcoR V 
2017 Not I 

Cla 



2797 Bgl II 



f?l JBSTITI ITE SHEET 



wo 92/21376 



PCT/US92/04S38 




X 
I 

> 
I 
I 

I 



1 
I 

I 
I 
I 

O 
I 
I 
1 

-J 
I 

< 
I 

-M 
X 



2 
O 
>- 



2 
cc 

I 



I 

! H 

X X 

7 ^ 

O 2 



I 

S 

< 
u. 

S 

> 



X z 



0 
(VI 




CM O — ^ d tu 

cuco — — -rvj^D — ^ — — 



'sz T -z -z 22 



o 

n 
o 





o 


O 


a: 


UJ 




UJ 




-J 


u. 


> 


_J 


z 


o 


o 




-J 






-J 


o 


u 


& 


X 


o 


UJ 


O 


z 




a 


_J 




a: 


i 


-J 


o: 










_j 






-J 




<o 






< 


O 


-J 


o 






O 


> 




< 




u. 


Ul 


u 


UJ 


Ui 


UJ 


O 




to 


O 


</) 


a: 


i 




o 

a: 


cc 


_i 










o 


o 


§ 


< 


o 


o 


o 








u 


UJ 




Ui 




>- 


O 










> 


•J 


< 


-J 


-1 




-J 


-J 








1- 


X 


3 






< 


u. 












UJ 






< 


> 


o 




< 


> 


> 


> 


< 


CO 


C3L 








•2- 


£ 








H 






o 






o 












^ 


•n 




! 






c 


x: 




u 


c 



! i d 



I 
I 

-J 
I 
t 

UJ 
CO 

cJ: 



I 



O 

cx: 
tc 

UJ 

•J 

UJ 

O 
u. 

GC 

< 



I — 

i cc 



CO 

«o 

2 

UJ 

to 
z 

o •n 



wo 92/21376 



PCT/LS92/04538 



FIO. >22B 

CAGOCGCTGAGACKXDCnX3AA(i;ACAGGCaiAAaXXAGCa:G0C^ 

•»••••• ♦ ♦ ♦ ♦ 100 

GGTCCAGOCACrCTCGGCACl lU-lOUUXXXnTGCCCrCCGGCGCTCC 

^ IXnXTITCACnXIACCAGCTXXliATXTIXXTIXTreAATGCX^ IC 

<\j ♦ — — — + 200 

(V4 GAGAAGlUACXnXXriXXAGCITAGACCACACTrACCGCGACCAGACAAG 

.GGGAlXXXTnXXZAACXXXXnATCOCCAGGTCAGAAGAGTCGCACAAGAG 

O ♦ + ♦ + + 300 

iZ TCXXTACGCAACXnTGGCGCATACXXXnXXAGiCAlC'l' L ACCGTCnCrC 

> aU P core 

X CIX:GGCIXXL\TCCTCTAAC7IXX;AAAGAAATrcCAGGriXXrrAG^ 

O * •*■ ♦ 500 

h- AGAGCCGACGTAGGAGATITLlCClllLUlTAACXntXAGCA'lUllL'OCCC 

< 

2 GAGAGGAGACCTAGTTCGCAACXnXXXXXUTGGGGATOGCrCAAGACrC 

' • ' ' ," * + * 600 

GCrCTCCIXnXXSATCAACCGTGCAGCGCCTACXXXTAGaJACTIXnX^ 

PsU 
I 

GGTACTACCCCCTGCAGACCrACXL\CAACAACA3XK2ACCCCGAA.CCCro 

* + ♦ ♦ 700 

CCATGATGCCCGACGTCrCGATX XiOl AmiUr ACCroCCCCITCGCAC 
YY A L Q T YD NKMDA EAW 

InlsUrt? 

ACCCGGCGAAGAAGGCAGCCCCCAGCGCCATCACCCTCGAGGAGTAC 
♦ ♦ 4. ♦ «4,800 

CClXXKX:CCCITCTIX:CCTCXXKXKrrcCCGCTACTXXX;ACCT^ 
D RAKKAAA S A IT LEE Y 
TACAGCCCGCACGOCGAGCGCCGCATCTAOCXXXTrcCTAGGrCAAGTCC ^ 

........ 4.......... + .. .......4... ........4. 

ATGTCGCaxnXKTGCCrcGCXXKXrrAGATCCGCCAOCATCCACrrCACC 
YSGIIAE RRIYPVLGEVA 

GTAGCAAGCACCa;ACTCC<XCCCXXX:ATCX:CTACAAOGTtXnXXXXX;^ 

♦ ♦ ♦ + 4- 1000 

ATLVIlCO'RXXXnXSACGCGCGGCOCTACGG A IUi'It X rAGGACCCCOC 
RKHP TA R RIIA YNV L RA 



ATCGAGCAGAAGCCAGCCGATCAGCCCCACXTrACAGCCCCrCACXXXT 



^ 1100 



TAGCTCGTXrrrCCGTCGGCTACIXXX:GCIGCATCTCCGCX3ACnXXXK;^ 
I EQ KAA DE R DVE A LT P 



CATACATCCTCGCGTTGGACGAGCCTCCCGTItXXJAGAGCrGATCGACC 
+ + + + + 12D0 

GTATC7rAGGACCGCACCrc<nXXX;AGCCCAAGCCnxntX;ACTAGCrCC 
VI LA WTSLR FGE LIEL 

GOCGriXX^GCTTCCCnCGTGCGGAACAAGATCt»'ILX;i 1 GCCAACGCCAA 

+ + + 1300 

CGGCACrGCGAAGGG<XKL\(XCCUi;ilUlAGCAGCAACXJGl'lGCGCrT 
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Fl O. 



. GTCGACCACCAA(KKXj\CrATXnxnXKnTXXKKX>(XXrGTl 

CAGCTGGriXXTITCCCXTrGGTAGAGACGAACCCGCTrc GGCAACCGGC^ 
jjjj CCGAGC<niXXCAOCCClXK:AACrcCCGGTCCAACCnTX7rcCCG<ntTAT 

CGCTCCXlAAGCXnXXXXJACGTrcAGGGCCACCrrrGGAACAGGGCCACA^ tD 

GCGCAGGCGGGGGGCIUTAilLUi'llUlCAGCATXX^AAAGTAGaiAGATCA fjj 
201 ♦ ♦ + + + ^ 

CCCXnCCCCCCCCXX3ACATAAGCAAACAGlXX*rAGCITIt:^^ O 
301 ^^^p^^ff'CCrcGAAAGAAAAATGGCCAGACCG^ ul 

AACGTCTCGGGACLl 1 lOl i 1 IIACCGCTOXTCC X 

Nde I 1 

I - 

401 TCGCIXTIXnGaXSACCACATATCGCCCGCrcAAGATACGTITrrACCOCCI $ 

ACCACAGACCCCTGCTGTATACCXrGCCCAGTrcrATCCAAAAAATGGGGG ^ 
501 TreAAGanx;AGAGTroCACAGGAGTIXX:AACCCCGTAG a:rii;ili: ACGAC K 

AACTTCGCACncnL^CCTXnXDCTCAACGn^^ 2 
Bamlll 

I 

AGCGCACOGGGAGCATCCAAGCCIXlATAOCTCAACOCCCAGGACGGTfnr.A 

, TCGCCTCCCCCTCCTAGCTTCGGAGTATGCAGITCCGCXTrcCrcC^ 

Im V ^ 

Int start? 

CCCCCCCCAGAAGCGGCrCATCGAGATGGAGACCTGGACCCClX:CACAGG 
701 ♦ * + ^. 

CCACCGCCOCCIXnrCGCCGAGTAGCTCTACClXntXJACCrCCCGAGCTfTr 
inl LAGEKRLIEMETWT PPQ 

A(XCCGAAGTGCCTCGTCCAGCCCCACCrcGCAGACGCCACCAGC(UTCrG 
801 4. ♦ + + . 

TGGCCCTTCACCGAGCACCIXXXXXrrCGAGCGTCTGCCGTGGr^^ 

TRKWLVERDLADGTRDL 

CCCTCACAGAGATGAOGCCAGCTXTOGTCCGTCaTIXXTIXXXXXXX^ 
901 + + + 

CCCAGTf7ItnxnViatXXK7ia3AGACCAa7CACGCA(^ 

VTEM TPA L VRAWWAGMG 

GGTCATCAACACAGCGCriXXJAGGACAAGCTCATOCCACAGAACCCGTCCCGG 
:00I + * + ...^ 

ccACTAcrTCTfnxitxrcAGCiccioi itjcACTACCi; iC'it:rn x xx:ACGGCc 

VMNTA VEDK LI AENP CR 

Bglll 
I 

CACGAGCICGACATCGTCGCCGCTCAGATCTItXAGCACTACCr^tATCGCC^ 
1101 ♦ + + + 

CTCCItXJACCTXTTAGCACCXXXXJACnn'AGAAGCItXnXSATGC^^ 

EELDIV AAEIFEIIYRIAA 

TTCGCCGCAAGGACATCGTXKJACGACGCC^TGAOGATCAAGCrCCGGGTGC 
12D1 + ♦ + + ^. 
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EcoOI09r3I48 
AQ.tn3089 



FIO. ^3 



XmMl2768 
ScQ.12640 




AIwNI 1689 



i278Af Ijn 



FIO. 2-4- 



AQ.tn9779 



H/MDni7609 
SphI 7603 
XbcxI 7585 
BQ.mHI7579 

Apo.17387 
Ndal 73S2 



404 Dra HI 
510 NoieT 

707 Bbcl 
707 Ncxhl 

868EcoRI 
874ScxcI 
880 ASp7J8 

880 KphI 
889 BcxmHI 
896 XbcxI 
901 Acc I 
901 HihCn 
901 ScxIT 
906.55pMI 

907 RslI 
913 SphI 

919 HinDm 

7^7 ^^*iS68EcoRI 
707 Ncxhl 874ScxcI 
880ASp7l8 
880 KpnJ 

884XmcuI 
889B<xtnHI 

II8650.C1 

2lOOStul 




CIciI 5519 
Na«.I 5463 



4605SMaBI 
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Xmn 146 58 
Seal 454 i 



AlwNr358l A\5050BA5E 

UNIQUE 

Af i ni 3170 

HinDIII28II 
PstI 2799 

XbctI2787 
BQ.mHI278I 
SHADE D PORTION = L5 DN A 




404 Dro. m 

707 Bb«I 
707 Ncxhl 

868EcoRl 
874Sa.cl 
880 ASp7l 8 
880KpnI 

884S^r,cxI 
884XmQ.I 
889 Bern HI 
939 Pstr 

I057BstXI 

l374B5a.36I 
l386Tth mi 

I6505pll 
16 57 Apcx.T 
I677Bsml 
1685 BstEl 
I764EcxgI 
1869 AUr TL 
i942BQ.II 



Xm i-ii 7608 



FIO. 

404DrQ.T 



AlwLfNI 6531 



HlnDm 5761 

Xb<aI5737 
B<xm HI 5731 



pMH-4 

+ 

8000 BASE PAIRS 
UNIQUE SITES 



707Bb«I 
707Ncxri 



l727Nd«I 
I782CIQ.I 

Z302Hr<t.Z 



Avrn 4819 
ApQ.1 4617 
Sf3 II 4600 
BSu 361 4324 



2798Bo II 
' • 3044SolI 
3226Sq.I1 

3839BQ.mHI 

SHADE D PORTION = L5 DN A 
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F-|0. -SB 

XmnT8569 

AlvwNI 7492 

HinDin 672 
PflMl 6534 

XmcJ 6149 
SmcLl 6149 
Xhol 5875 

Pa.«R7jr5875 
HimD nr 576i 

P3 115749 

Sail 5743 

yboI573 7 
BcxmHI 5731 

Sex II &3I0 



1- 



404 Dra HI 

707 Bb« I 
707 NclM 




1327Sq.II 

2075Sq.II 

2302 Nh«r 

2798BoII 

3044Sq.II 

32265q.I1 



3839BQ.mHI 
3889 Rst I 

4 324 BSU 36J 
AurII48I9 

ApQ.14617 
5q.II 4600 



Aq-IU 6044 
XmrNl5721 



Fio. 

404 Dra m 



AlwhJl 4644 




707 Bb« I 
707 Na.hl 

ll37BSlEn 



lOUE 

HinDin 3874 
Sohl 3868 
P5tl 3862 
XbcxI 3850 
BQ.mhI3844 

^pcLl 3652 

Ndcl 3647 

BSmI 3410 Nr\jl 3088 

BstX13259 
SHADE D PORTION = L5 DNA 



I728Na«l 
1755 B9III 

1784 CIqI 



2303 Nh«I 

25l4EQ.gI 

2799 Bell 



o» iQcrrm rn= SHEET 
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Fl O. 39 



43AluI 
62AluI 
238 Alul 
404 Drci m 
510 NoieT 
707 Bb«I 
707 Ndhl 



Eco 0 1091 411 1 
Aq± H 4054 

Xmhl 3731 
Alul 3613 
Pf IMI3424 
EcoNT3075 

Clal 2857 
Nl UI 2QZZ 
XhoT 2765 

PocR7 2765 

AIuI 2652 
Alul 2522 

AIuI2459-X,^i2359 
SHADE D PORTION = L5 DN A 




FIO. 31 

EcoO 1091 411 1 
AqA TE 4054 

Xmhl 3731 
ScQ.13610 



AIwNI 2680 



Af j HI 223 9 




SHADED 



8 68 Eco RI 
874 So.cI 
880 ASp7l8 

880 KphI 
889 BcxmHI 
896 XbcxI 
901 Acc I 
901 HihCH 
901 Scxll 

906 BSpMl 

907 P5tl 
913 SphI 

919 HinDin 
920AIvjiI 
942 Aful 
I037AIUI 
IIOI A lul 
l2l9AluI 
l278Af im 

I445A)UT 

15 81 Alul 

l689AlWNr 
1838 Alol 

868EcoRI 

874Sq.cI 
880 ASp7l8 

880 KphI 

889 Bam H I 
896 XbcxI 
901 Acc I 

901 Hincn 

901 Sail 
906 BSpMI 

907P5tI 
913 SphI 

919 HinDm 
1033 Pcx«R7I 
1033XhoI 
l090Nrul 
1125010.1 

134 3 EcoNl 



1692 PtMI 
1880 HinDin 
ORriON= L5 DNA 



404 Drcx HI 
510 NoicI 
707 Bb«I 
707 Ncifl 



PCr/US92/04538 



' Fl O. 30 



.A ; 



mc 61 



mc 155 



mc 1 55 
pYUB12 



mc 155 
pMH5 
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